aws-redshift

Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3

docker airflow data-pipelines aws-redshift

Updated Nov 22, 2021
Python

Wittline / uber-expenses-tracking

Sponsor

Star

The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such as Apache Airflow, AWS Redshift and Power BI.

python aws uber power-bi data-engineering data-modeling aws-redshift airflow-docker uber-data apache-airflow etl-pipeline uber-eats expenses-dashboard expenses-tracker

Updated Jun 29, 2022
Jupyter Notebook

alanchn31 / Movalytics-Data-Warehouse

Star

Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow

docker airflow udacity sql spark analytics aws-s3 movie-database python3 pyspark data-engineering redshift movie-reviews movie-recommendation aws-redshift data-engineering-pipeline data-modelling data-warehouse-cloud data-engineer-nanodegree

Updated Jun 16, 2020
Python

awslabs / clickstream-analytics-on-aws

Star

Build clickstream analytics on AWS for your mobile and web applications

aws web-analytics data-analysis aws-redshift clickstream aws-kinesis-stream aws-amplify aws-cdk aws-msk aws-quicksight aws-solutions web-analysis aws-emr-serverless aws-clickstream-solution

Updated May 26, 2024
TypeScript

heroku-examples / analytics-with-kafka-redshift-metabase

Star

An example system that captures a large stream of product usage data, or events, and provides both real-time data visualization and SQL-based data analytics.

heroku kafka data-visualization metabase data-analytics aws-redshift

Updated Jan 11, 2023
JavaScript

FedericoSerini / DEND-Project-3-Data-Warehouse-AWS

Star

Project 3 - Data Engineering Nanodegree

aws aws-s3 data-engineering udacity-nanodegree aws-redshift

Updated Apr 26, 2019
Python

tmheo / spring-data-jpa-redshift-sample

Star

spring boot data jpa integration with aws redshift sample

spring-boot spring-data-jpa aws-redshift

Updated Feb 13, 2016
Java

vsouza / spark-kinesis-redshift

Star

Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark

python shell aws spark etl spark-streaming aws-kinesis aws-redshift aws-kinesis-stream etl-pipeline

Updated May 22, 2018
Python

ismaildawoodjee / aws-data-pipeline

Star

A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from locally hosted Airflow containers. The end product is a Superset dashboard and a Postgres database, hosted on an EC2 instance at this address (powered down):

python docker aws airflow sql etl terraform aws-s3 postgresql aws-emr data-engineering infrastructure-as-code aws-ec2 aws-iam elt data-pipeline aws-redshift apache-superset

Updated May 14, 2022
Python

lenguyenthedat / aws-redshift-to-rds

Star

A simple command-line tool to copy tables from Amazon Redshift to Amazon RDS (PostgreSQL).

aws haskell rds amazon-redshift aws-redshift amazon-rds

Updated Feb 15, 2017
Haskell

FedericoSerini / DEND-Project-5-Data-Pipelines

Star

Project 5 - Data Engineering Nanodegree

aws aws-s3 data-engineering data-pipelines udacity-nanodegree aws-redshift apache-airflow

Updated Jun 26, 2019
Python

polarbeargo / Udacity-nd027-Data-Warehouse

Star

aws-s3 postgresql s3-bucket data-warehouse aws-iam aws-redshift etl-pipeline redshift-cluster

Updated Jun 27, 2021
Jupyter Notebook

AnMol12499 / Reddit-Analytics-Integration-Platform

Star

Project was based on an interest in Data Engineering, ETL pipeline. It also provided a good opportunity to develop skills and experience in a range of tools. As such, project is more complex than required, utilising dbt, airflow, docker and cloud based storage.

python docker airflow terraform aws-s3 dbt aws-redshift etl-pipeline google-studio

Updated Sep 12, 2023
Python

polo2444172276 / Udacity-Data-Engineering-Nanodegree

Star

Completed Udacity's data engineering nano degree. Went through a series of exercises and projects to learn and practice the trendy big data management tools.

aws airflow spark cassandra aws-s3 postgresql data-warehouse data-engineering data-lake aws-ec2 aws-redshift etl-pipeline

Updated Aug 2, 2022
PLpgSQL

Prajna-Bahuguna / Redshift-Terraform

Star

Configuring Redshift cluster using Terraform.

aws terraform iac aws-redshift

Updated Nov 10, 2021
HCL

moritzkoerber / covid-19-data-engineering-pipeline

Star

A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation and CDK, deployable via Github Actions.

api docker aws spark apache-spark aws-lambda aws-s3 pyspark aws-ecr aws-cloudformation aws-redshift apache-airflow aws-glue aws-cdk great-expectations

Updated Nov 21, 2023
Python

Improve this page

Add a description, image, and links to the aws-redshift topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the aws-redshift topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

aws-redshift

Here are 115 public repositories matching this topic...

alanchn31 / Data-Engineering-Projects

tokern / piicatcher

aws / amazon-redshift-python-driver

KentHsu / Udacity-Data-Engineering-Nanodgree

shravan-kuchkula / udacity-data-eng-proj-1

Wittline / uber-expenses-tracking

alanchn31 / Movalytics-Data-Warehouse

awslabs / clickstream-analytics-on-aws

heroku-examples / analytics-with-kafka-redshift-metabase

FedericoSerini / DEND-Project-3-Data-Warehouse-AWS

tmheo / spring-data-jpa-redshift-sample

vsouza / spark-kinesis-redshift

ismaildawoodjee / aws-data-pipeline

lenguyenthedat / aws-redshift-to-rds

FedericoSerini / DEND-Project-5-Data-Pipelines

polarbeargo / Udacity-nd027-Data-Warehouse

AnMol12499 / Reddit-Analytics-Integration-Platform

polo2444172276 / Udacity-Data-Engineering-Nanodegree

Prajna-Bahuguna / Redshift-Terraform

moritzkoerber / covid-19-data-engineering-pipeline

Improve this page

Add this topic to your repo