etl
Here are 3,710 public repositories matching this topic...
Flink CDC is a streaming data integration tool
-
Updated
Jun 3, 2024 - Java
A machine-readable, human-editable database of the Yu-Gi-Oh! Trading Card Game, Official Card Game, Master Duel, Rush Duel, Speed Duel.
-
Updated
Jun 3, 2024 - Python
Scripts to extract, transform, and load Los Angeles Yelp data from the Yelp Fusion API and Kaggle.
-
Updated
Jun 3, 2024 - Jupyter Notebook
🧙 Build, run, and manage data pipelines for integrating and transforming data.
-
Updated
Jun 3, 2024 - Python
An orchestration platform for the development, production, and observation of data assets.
-
Updated
Jun 3, 2024 - Python
Production Grade Nifi & Nifi Registry. Deploy for VM (Virtual Machine) with Terraform + Ansible, Helm & Helmfile for Kubernetes (EKS)
-
Updated
Jun 3, 2024 - Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
Updated
Jun 3, 2024 - Python
Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage
-
Updated
Jun 3, 2024 - Go
Wikidata and Wikipedia language data extraction
-
Updated
Jun 2, 2024 - Python
Various projects on applications of Data Science and Machine Learning
-
Updated
Jun 2, 2024 - Jupyter Notebook
A generic workflow and processing engine which can be configured to do a large variety of tasks.
-
Updated
Jun 2, 2024 - PHP
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
-
Updated
Jun 2, 2024 - Python
Dag server based on quartz, allows to execute batch processes modeled as DAG (Direct Acyclic graph). Inspired by Apache Airflow and IBM Datastage
-
Updated
Jun 2, 2024 - JavaScript
Efficient data transformation and modeling framework that is backwards compatible with dbt.
-
Updated
Jun 2, 2024 - Python
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
-
Updated
Jun 2, 2024 - Java
Pull and standardize data on cloud compute resources.
-
Updated
Jun 2, 2024 - Python
Improve this page
Add a description, image, and links to the etl topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the etl topic, visit your repo's landing page and select "manage topics."