Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups
-
Updated
Sep 17, 2018
Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups
Examples that I use to learn and show Apache Beam
A data engineering pipeline for digital marketers.
ETL pipeline combined with supervised learning and grid search to classify text messages sent during a disaster event
ETL Pipeline / ML Pipeline of Disaster Data provided by figure8
Disaster Response Pipeline | Data Engineering
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
A quick implementation of OCR Application with AWS Lambda.
Data Engineering Projects including Data Modeling, Data Warehouse, Data Lake Development
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
Data pipeline with Apache Airflow - Data Engineering Nanodegree (DEND) 5th Project
ETL pipeline for construction permits data in Los Angeles built on AWS S3, Lambda and RDS PostgreSQL.
Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DynamoDB as the database
Takes product reviews and performs natural language processing to provide sentiment analysis. The new insight gets combined with matching product information in the central database to provide a clearer picture of user behavior.
This is an ETL project - extracting data from an ecommerce transactional database on RDS, transforming the data using AWS glue job, and loading it to a Redshift data warehouse, and connected it to Tableau for BI
This repository is the collection point for all of the projects completed during the Udacity Data Engineering Nano Degree program.
Projects and Exercises for Udacity Data Engineering Nano Degree
Building Machine Learning and ETL Pipelines to categorize emergency messages based on the needs communicated by the sender
A streaming ETL pipeline for Realtime Tweet Collection, Analysis and Reporting
Add a description, image, and links to the data-engineering-pipeline topic page so that developers can more easily learn about it.
To associate your repository with the data-engineering-pipeline topic, visit your repo's landing page and select "manage topics."