ETL

General Info

This is a work in progress. The aim of this project is to create a data pipeline that transforms and loads very large data sets into databases

Technology

Python.
bash script.

Setup

Coming soon :)

Status

Still putting things in the right place. Currently using demo data_lake from a case study.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data_catalog		data_catalog
spark_pipelines/pyhumana/pyhumana_clean_pipeline		spark_pipelines/pyhumana/pyhumana_clean_pipeline
src		src
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data_catalog

data_catalog

spark_pipelines/pyhumana/pyhumana_clean_pipeline

spark_pipelines/pyhumana/pyhumana_clean_pipeline

src

src

.gitignore

.gitignore

README.md

README.md

init.py

init.py

Repository files navigation

ETL

General Info

Technology

Setup

Status

About

Releases

Packages

Languages

mbsuraj/ETL

Folders and files

Latest commit

History

Repository files navigation

ETL

General Info

Technology

Setup

Status

About

Topics

Resources

Stars

Watchers

Forks

Languages