Overview

Tech Stack

Docker
Python
Metabase
Postgresql
Github actions (ci/cd)

Project Overview

In this project, we initially used Python and SQLAlchemy to load a CSV file containing a list of sales and movement data by item and month, in a schema on Postgres called "landing_area" (see pipeline/raw_data_to_landing.py). We then applied some transformation logic on that table with Pandas to build a star schema, and subsequently loaded it into the "staging_area" for visualization.

Run the pipeline

Here are the commands to set up the environment:

make up: Create and run all the containers.
make ci: Format, and run the tests
make etl: Run the pipeline.
make warehouse: Connect to the Postgres database and check the data.
Go to localhost:3000 to open Metabase.
make down: Stop the containers.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github/workflows		.github/workflows
assets		assets
dataset		dataset
pipeline		pipeline
sql		sql
test		test
.env		.env
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
docker-compose.yaml		docker-compose.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

assets

assets

dataset

dataset

pipeline

pipeline

sql

sql

test

test

.env

.env

.gitignore

.gitignore

Dockerfile

Dockerfile

Makefile

Makefile

README.md

README.md

docker-compose.yaml

docker-compose.yaml

requirements.txt

requirements.txt

Repository files navigation

Overview

Tech Stack

Project Overview

Run the pipeline

Date model

Dashboard

About

Releases

Packages

Languages

Dorianteffo/etl_pipeline_docker_metabase

Folders and files

Latest commit

History

Repository files navigation

Overview

Tech Stack

Project Overview

Run the pipeline

Date model

Dashboard

About

Topics

Resources

Stars

Watchers

Forks

Languages