aws-redshift

The goal of this project is to build data pipeline for gathering real-time carpark lots availability and weather datasets from Data.gov.sg. These data are extracted via API, and stored them in the S3 bucket before ingesting them into the Data Warehouse.

airflow redshift data-pipeline aws-redshift carpark carpark-sg carpark-availability

Updated Sep 1, 2019
Python

rigganni / AWS-RedShift-Music-Analysis

Star

Load data from the Million Song Dataset into AWS RedShift.

aws etl redshift aws-redshift dimensional-model

Updated May 8, 2020
Python

epomatti / aws-redshift

Star

AWS Redshift

aws terraform s3 redshift aws-redshift

Updated Aug 19, 2022
HCL

ibromley / sparkify-redshift-dwh

Star

Data Warehousing in AWS with Redshift

aws etl datawarehouse aws-redshift

Updated Sep 12, 2019
Jupyter Notebook

mcamarad / ETL_music_streaming_app

Star

aws data-science database etl aws-s3 postgresql rdbms aws-redshift dataengineering etl-pipeline

Updated Jul 7, 2020
Python

bhavinidata / DatawareHouse-AWS-Redshift

Star

An implementation of a Data Warehouse leveraging AWS RedShift. This project builds an ETL pipeline for the database hosted on AWS Redshift that extracts their data from multiple JSON files residing in S3 buckets, stages them in Redshift, and transforms data into a set of dimensional tables for their analytics team to continue finding insights in…

python sql s3-bucket aws-redshift

Updated Mar 8, 2021
Jupyter Notebook

sanogotech / aws-data-pipeline

Star

A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from locally hosted Airflow containers. The end product is a Superset dashboard and a Postgres database, hosted on an EC2 instance at this address (powered down):

python docker aws airflow terraform aws-redshift apache-superset

Updated May 14, 2022
Python

MartinKalema / mysql-kafka-s3-redshift-data-pipeline

Star

ETL pipeline

streaming kafka aws-s3 mysql-database aws-redshift

Updated Apr 9, 2024
Python

jomavera / dataPipeline

Star

ETL pipeline with AWS Redshift orchestrated with Airflow

data-warehouse data-engineering data-pipeline aws-redshift apache-airflow

Updated Mar 10, 2021
Python

FutureTroglodyte / udacity-nd027-data_pipelines

Star

Udacity Data Engeneering Nanodegree Program - My Submission of Project: Data Pipelines

airflow aws-s3 data-engineering data-pipelines aws-redshift etl-pipeline airflow-operators

Updated Apr 12, 2021
Python

ManjinderSingh3 / ETL-Operations-using-AWS-Glue-and-Redshift

Star

Used AWS Glue to perform ETL operations and load resultant data to AWS Redshift. In the second phase used AWS CloudWatch rules and LAMBDA to automatically run GLUE Jobs

aws aws-lambda aws-redshift etl-pipeline aws-glue