hiveql
Here are 161 public repositories matching this topic...
Problems on Hadoop-MapReduce, Hive and PySparkSQL
-
Updated
Dec 14, 2022 - Java
Trabajo para el curso de Hadoop, realizado por el Grupo 5
-
Updated
Aug 18, 2022 - Shell
In this project, the objective was to analyze the "User, Occupation, Movies, and Ratings" dataset using Apache Hive. The data was processed and analyzed using Hive's SQL-like query language and MapReduce framework, making it easier to handle large datasets. The focus of the analysis was to provide a comprehensive breakdown of the data
-
Updated
Jan 31, 2023
-
Updated
Mar 6, 2019 - Python
Documented my learnings - how to perform DML operations in HIVE.
-
Updated
Oct 18, 2020 - HiveQL
Real Time Streaming: Twitter Data Pipeline Using Big data Tools
-
Updated
May 8, 2023 - Python
The repository showcases a series of exercises and projects focused on big data processing using Hadoop, HBase, Hive, and Spark with Python. Hosted on AWS EMR, these projects demonstrate efficient data handling and processing techniques, leveraging the power of cloud computing to tackle complex data challenges.
-
Updated
May 14, 2024
Processing and transforming data via Hadoop Ecosystem
-
Updated
Nov 26, 2020 - Python
Joining, Cleaning, Querying, Performing ETL on Twitter Posts Dataset.
-
Updated
Jun 11, 2020 - Python
Performed Analytics on covid data from ECDC website utilizing Azure capabilities - ADF, Databricks, HDInsights
-
Updated
May 8, 2024 - PowerShell
-
Updated
May 28, 2022
Improve this page
Add a description, image, and links to the hiveql topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hiveql topic, visit your repo's landing page and select "manage topics."