Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
Updated
Jun 1, 2024 - Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
This is the process of automatically extracting data from websites. This can include text, images, and other media types from various web pages.
Workflow Engine for Kubernetes
AWS Summit 2022 ASEAN --- COM203 Using IaC with Terraform to provision Big Data Platform on Amazon EMR
Production Grade Terraform for Provisioning Infrastructure
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
TU Libraries Airflow & DAGs Deployment Ansible Role
Airflow DAGs for the Stellar ETL project
🌎 terraform provider for double.cloud
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Material correspondiente a AMq2
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
This project implements an image annotation pipeline using AWS, Apache Kafka and airflow, featuring a Flask web application for labeling images. The annotated images are automatically uploaded to an S3 bucket, ensuring efficient and scalable data processing.
Distributed run of dbt models using Airflow
A plugin for Apache Airflow that allows you to edit DAGs in browser
Add a description, image, and links to the airflow topic page so that developers can more easily learn about it.
To associate your repository with the airflow topic, visit your repo's landing page and select "manage topics."