Skip to content

rmihir96/Exploratory-Analysis-in-R

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Exploratory-Analysis-in-R

Replicate CDC Flu data analysis using twitter data in R

Problem Definition:

The task is to replicate the US flu heatmap provided on (www.cdc.gov/flu/weekly/.com) by mining flu related tweets from Twitter using Twitter API. The heat map is to be plotted using geolocations of the mined tweets.

Pre-requisites

Programming Language : R

Packages used: Rtweet, ggmap, ggplot

Analysis

Around 100,000,00 tweets were mined using different keywords like "flu", "#flu", "influenza", etc. For tweets without locations, the location is added using "geocode" from Google's API. The US heatmap generated using the tweets isnt as close as the one on the CDC website.

The results can be seen on the ShinyApp deployed.

About

Replicate CDC Flu data analysis using twitter data in R

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages