Skip to content

BinariesGoalls/Udacity-Data-Engineering-Nanodegree

Repository files navigation


Data Engineering Nanodegree

Udacity Nanodegree
Explore the repository»

Certificate of Completion

About The Nanodegree

You can check more details about the nanodegree program on this link: Data Engineering Nanodegree

Program Details

During this program, I will learn to design data models, build data warehouses and data lakes, automate data pipelines, and work with massive datasets. At the end of the program, I’ll combine my new skills by completing a capstone project.

Course 1 – Data Modeling

In this course, I’ll learn to create relational and NoSQL data models to fit the diverse needs of data consumers. I’ll understand the differences between different data models, and how to choose the appropriate data model for a given situation. I’ll also build fluency in PostgreSQL and Apache Cassandra.

You can find my demos and exercices related to this course here.

The first project can be found here Project 1: Data Modeling with PostgreSQL.

The second project can be found here Project 2: Data Modeling with Apache Cassandra.

Course 2 – Cloud Data Warehouses

In this course, I’ll learn to create cloud-based data warehouses. I’ll sharpen my data warehousing skills, deepen my understanding of data infrastructure, and be introduced to data engineering on the cloud using Amazon Web Services (AWS).

You can find my demos and exercices related to this course here.

This course project can be found here Project 3: Data Warehouse.

Course 3 – Spark and Data Lakes

In this course, i'll learn about the big data ecosystem and how to use Spark to work with massive datasets. I'll also learn about how to store big data in a data lake and query it with Spark.

You can find my demos and exercices related to this course here.

This course project can be found here Project 4: Data Lake.

Course 4 – Automate Data Pipelines

In this course, I’ll learn how to schedule, automate, and monitor data pipelines using Apache Airflow. I’ll learn to run data quality checks, track data lineage, and work with data pipelines in production.

You can find my demos and exercices related to this course here.

This course project can be found here Project 5: Data Pipelines with Airflow.

Capstone Project

In this module I have combined my new skills by completing a capstone project.

The project can be found here Capstone Project: ETL Pipeline for a Brazillian E-Commerce.

Contact

Alisson lima - ali2slima10@gmail.com

Project Link: https://github.com/BinariesGoalls/Data-Engineering-Nanodegree

Linkedin: https://www.linkedin.com/in/binariesgoalls/

Releases

No releases published

Packages

No packages published