Data Pipeline

Quick Links

CLAAT document

Introduction: Three experiments with Big data

In this project, we will develop a data pipeline to ingest, process, store it so you can access it through different means.

Data Explanation

SEVIR: The Storm EVent ImagRy (SEVIR) dataset is a collection of temporally and spatially aligned images containing weather events captured by satellite and radar.

The dataset contains thousands of samples of 4 hour events captured by one or more of these weather sensors. This loop shows one such event:

Start by reading this tutorial on the dataset.
Dataset catalog

Storm Events Database: The database currently contains data from January 1950 to November 2020, as entered by NOAA's National Weather Service (NWS). Data are available on the Registry of Open Data on AWS. Dataset and the Website

Setup

Python 3.7+
Python IDE
Code editor
Amazon S3 Buckets
Amazon Glue
Amazon Athena
Amazon Quicksight
Google storage buckets
Google Dataflow
Google Bigquery
Data studio
Snowflake
Sql-alchemy
Apache Superset

Clone

Clone this repo to your local machine using https://github.com/goyal07nidhi/Data-Pipeline.git

Folder Contents

Refer README.md inside the respective directories for setup instructions.

✅ AWS S3: AWS
✅ GCP - Dataflow, Datalab: GCP
✅ SNOWFLAKE: SNOWFLAKE

Team Members:

Nidhi Goyal
Kanika Damodarsingh Negi
Rishvita Reddy Bhumireddy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AWS

AWS

GCP

GCP

SNOWFLAKE

SNOWFLAKE

data

data

.gitignore

.gitignore

EDA_Sevir_&_Storm_Events.ipynb

EDA_Sevir_&_Storm_Events.ipynb

README.md

README.md

Repository files navigation

Data Pipeline

Quick Links

Table of Contents

Introduction: Three experiments with Big data

Data Explanation

Setup

Clone

Folder Contents

Team Members:

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
AWS		AWS
GCP		GCP
SNOWFLAKE		SNOWFLAKE
data		data
.gitignore		.gitignore
EDA_Sevir_&_Storm_Events.ipynb		EDA_Sevir_&_Storm_Events.ipynb
README.md		README.md

goyal07nidhi/Data-Pipeline

Folders and files

Latest commit

History

Repository files navigation

Data Pipeline

Quick Links

Table of Contents

Introduction: Three experiments with Big data

Data Explanation

Setup

Clone

Folder Contents

Team Members:

About

Topics

Resources

Stars

Watchers

Forks

Languages