hadoop
Here are 3,322 public repositories matching this topic...
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
-
Updated
May 11, 2024 - Scala
Alluxio, data orchestration for analytics and machine learning in the cloud
-
Updated
May 11, 2024 - Java
💾 Welcome to the Big Data Analytics Repository! 📚✨ Immerse yourself in a carefully curated reservoir of knowledge on Big Data Analytics. 🌐💡 Explore the intricacies of deriving insights from vast datasets and navigating powerful analytics tools. 🚀🔍
-
Updated
May 11, 2024 - Java
-
Updated
May 11, 2024 - Jupyter Notebook
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
-
Updated
May 11, 2024 - Java
This project aims to address Egypt's energy challenges by leveraging data-driven solutions. With increasing demand from urban centers and industries, conventional approaches such as random power cuts have proven ineffective. To tackle this issue, we are adopting a proactive strategy grounded in data analytics.
-
Updated
May 10, 2024 - Jupyter Notebook
Library for per-file client-side encyption in Hadoop FileSystems such as HDFS or S3.
-
Updated
May 10, 2024 - Java
Scalable data processing pipelines in JavaScript
-
Updated
May 10, 2024 - TypeScript
-
Updated
May 10, 2024 - Java
Scalable, redundant, and distributed object store for Apache Hadoop
-
Updated
May 11, 2024 - Java
CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and artificial intelligence functionality along with secure user access and data governance features.
-
Updated
May 10, 2024 - Java
Improve this page
Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."