Weather_Difference_Between_SFO/SEA

API used:

http://openweathermap.org/api

Structure

Step1

Use the requests library to pull data from api every minute
The stream of data is shipped to S3 via kinesis using the boto3 library
Run the above steps in an EC2 instance non-stop

Step2

Spin up an EMR cluster
Using Spark to read all the data from S3
Generate tables that are 3NF compliant and save them as parquet files in S3
Using Spark to read the parquet files and create temp views
Using Spark sql to select data from those views and make it a Spark DataFrame
Turn that Spark DataFrame into a Pandas DataFrame
Perform machine learning on that table and make a plot, save to S3
Shut down the cluster
Repeat the above step everyday by setting a cron job in an EC2 so new plots are generated everyday with updated data

Step3

Setup a Flask app that reference the plot saved in S3
Run this app on an EC2 instance non-stop

Table Structure

output

http://34.207.114.54:80

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
8_area.md		8_area.md
README.md		README.md
bootstrap.sh		bootstrap.sh
cron_job.sh		cron_job.sh
flow.png		flow.png
table.png		table.png
wea.py		wea.py
wea1.py		wea1.py
website.py		website.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

8_area.md

8_area.md

README.md

README.md

bootstrap.sh

bootstrap.sh

cron_job.sh

cron_job.sh

flow.png

flow.png

table.png

table.png

wea.py

wea.py

wea1.py

wea1.py

website.py

website.py

Repository files navigation

Weather_Difference_Between_SFO/SEA

API used:

Structure

Step1

Step2

Step3

Table Structure

output

About

Releases

Packages

Languages

minhengwu/de_final_project

Folders and files

Latest commit

History

Repository files navigation

Weather_Difference_Between_SFO/SEA

API used:

Structure

Step1

Step2

Step3

Table Structure

output

About

Topics

Resources

Stars

Watchers

Forks

Languages