big-data
Here are 4,019 public repositories matching this topic...
BlockMesh, is an innovative, open and secure network that allows you to easily monetize your excess bandwidth. Giving you a great opportunity to passively profit and participate in the frontline of AI data layer, online privacy, open source and blockchain industries.
-
Updated
Jun 2, 2024 - JavaScript
Apache DataFusion SQL Query Engine
-
Updated
Jun 2, 2024 - Rust
ClickHouse® is a real-time analytics DBMS
-
Updated
Jun 2, 2024 - C++
curated list of awesome tools and libraries for specific domains
-
Updated
Jun 2, 2024
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
-
Updated
Jun 2, 2024 - Python
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.
-
Updated
Jun 2, 2024 - Java
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
-
Updated
Jun 2, 2024 - Java
The Open Source Feature Store for Machine Learning
-
Updated
Jun 2, 2024 - Python
ELTL pipeline to monitor air quality in the Paris Île-de-France area
-
Updated
Jun 2, 2024 - Python
A time series database for storing and managing large amounts of blob data
-
Updated
Jun 2, 2024 - Rust
🚄 FASTJSON2 is a Java JSON library with excellent performance.
-
Updated
Jun 2, 2024 - Java
YTsaurus is a scalable and fault-tolerant open-source big data platform.
-
Updated
Jun 2, 2024 - C++
🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统
-
Updated
Jun 2, 2024 - C++
The most widely used Python to C compiler
-
Updated
Jun 2, 2024 - Python
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)
-
Updated
Jun 2, 2024 - Go
Scalable, redundant, and distributed object store for Apache Hadoop
-
Updated
Jun 2, 2024 - Java
Improve this page
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."