Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
Updated
Jun 1, 2024 - Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
The open source high performance ELT framework powered by Apache Arrow
Categorical Query Language IDE
🧙 Build, run, and manage data pipelines for integrating and transforming data.
An orchestration platform for the development, production, and observation of data assets.
CloudQuery Go SDK for source and destination plugins
Upserts, Deletes And Incremental Processing on Big Data.
Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift) in real-time.
Hop Orchestration Platform
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
Business Automations is a collection of automations built to enhance productivity, increase revenue, and reduce manual data manipulation at a retail store location that integrates a NCR Counterpoint SQL database with the BigCommerce e-commerce platform.
Perform historical snapshots without database locks and read change data capture logs from databases. Artie Reader is compatible with Debezium and written in Go.
Lean and mean distributed stream processing system written in rust and web assembly.
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
Automates the collection, transformation, and presentation of web analytics data from Google Analytics 4 and Google Search Console into Google Sheets for streamlined reporting and analysis.
An Efficient RML-Compliant Engine for Knowledge Graph Construction
Turns Data and AI algorithms into production-ready web applications in no time.
Flink CDC is a streaming data integration tool
Build REST APIs/Integrations in minutes instead of hours - NF Compose is a (data) integration platform that allows developers to define REST APIs in seconds instead of hours. Generated REST APIs are backed by postgres and support automatic consumer webhook notifications on data changes out of the box.
Add a description, image, and links to the data-integration topic page so that developers can more easily learn about it.
To associate your repository with the data-integration topic, visit your repo's landing page and select "manage topics."