Skip to content

mikekenneth/airflow_minio_twitter_data_pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Twitter Data Pipeline with apache-airflow & MinIO

Table of Contents

About

A simple example of Data Pipeline using apache-airflow (Orchestrator) and MinIO(Object Storage like s3).

Getting Started

Below is a design of the project. (The drawio file can be found in docs/architecture.drawio: Twitter Data Pipeline - Architecture

Prerequisites

Usage

  1. We need to create and .env from sample.env:
cp sample.env .env
  1. Add the Twitter Bearer Token in the .env file as below:
TWITTER_BEARER_TOKEN="vNVxBVjj-0yhF!Ipc-p7Nrzl7C2wISOI6BLXVk087/jJS4auIp0SKSXI/7npGy1kl7xDXxRuJ55Lor5FHI!6!!a5v0!IrxCDYQDEgMBQzOZivgIEpQJsvC4A0nqFbqxA"
  1. We can simply run the pipeline using docker-compose.

To start

docker compose up -d

To shutdown

docker compose down
  1. Then we can connect the below respectively:

Resources

License

This project is licensed under the MIT License - see the LICENSE file for details.

Credits

See as you fit.

Contact

If you have any questions or would like to get in touch, you can email: mike.kenneth47@gmail.com OR twitter

Releases

No releases published

Packages

No packages published