Skip to content

SEPHIRONOVA/Data_Engineering_Projects

Repository files navigation

Udacity Data Engineering Nano Degree

This repository contains the project completed for Udacity Data Engineering Nano Degree

Project 1 - Data Modeling with Postgres

Created ETL pipline to load data into star schema

Link

Project 2 - Data Modelingwith Apache Cassandra

Modeling data with Apache Cassandra to satisfy specific analytics query requirement

Link

Project 3 - Data Warehouse

Design the destination table and loaded data from AWS S3 to AWS Redshift

Link

Project 4 - Data Lake

Built the ETL process on cloud with Spark and load back to AWS S3

Link

Project 5 - Data Pipelines

Construct data pipelines with Airflow by loading data from AWS S3 to AWS Redshift

Link

Capstone Project - ETF Research Data Pipeline

Created a data pipeline for index in different geographical location and different sectors. It allows for easily accessible index data for ETF research purpose.

Link

Releases

No releases published

Packages

No packages published