Twitter Sentiment Analysis

By:

Vishnu Nair (@NairVish)
Keith Low (@keithlowc)

A series of scripts that downloads tweets for a list of cities (in this case, northeastern U.S. cities) using a particular query, cleans them, computes an average sentiment for each city's tweets, and plots these average sentiment scores on a bubble map of the northeastern U.S.

Screenshots

The following sentiment maps were created using tweets pulled during the evening of October 6, 2018 using the query "trump."

Metro NYC area

Northeast U.S. (PA, NJ, NY, CT, RI, MA, VT, NH, ME)

Interactive Demo

The actual heatmap output for the above query can be found here.

Structure

shp_to_csv.py: Forms the list of cities to get tweets for by parsing shapefile data from the USGS and Gazetteer data from the U.S. Census Bureau. The output is a CSV file of all of the cities and their respective states, lat/lons, and size.
- Specifically, we use the ESRI northeast cities shapefiles for the Long Island Sound ArcView project area from the U.S. Geological Survey to get a list of cities in the Northeast, and the 2010 Census Gazetteer Files from the U.S. Census Bureau (specifically places data for each of the states in question) to get each city's size.
- From the above processing, we attempt to get data for a total of 606 cities/towns/etc.
grab_data.py: Connects to the Twitter API (standard tier) to grab tweets for each specified city.
- keys.py holds the appropriate API keys.
- To allow for quick processing, we only download up to 100 tweets. (Though, of course, more will be useful.)
json_parser.py: Cleans the tweet data (links, some special characters, etc.) and removes duplicates.
analyze_data.py: Computes sentiments on the cleaned tweet data (using VADER Sentiment Analysis), plots them on a bubble map (using Folium), and saves the resultant map in an html file.
main.py: A simple script that executes almost all of the above in one go.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
demo		demo
tweets		tweets
usgs_ne_cities_shapefile		usgs_ne_cities_shapefile
.gitignore		.gitignore
README.md		README.md
analyze_data.py		analyze_data.py
cities.csv		cities.csv
grab_data.py		grab_data.py
json_parser.py		json_parser.py
keys.py		keys.py
main.py		main.py
requirements.txt		requirements.txt
shp_to_csv.py		shp_to_csv.py
us_census_gazetteer_ne.txt		us_census_gazetteer_ne.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

demo

demo

tweets

tweets

usgs_ne_cities_shapefile

usgs_ne_cities_shapefile

.gitignore

.gitignore

README.md

README.md

analyze_data.py

analyze_data.py

cities.csv

cities.csv

grab_data.py

grab_data.py

json_parser.py

json_parser.py

keys.py

keys.py

main.py

main.py

requirements.txt

requirements.txt

shp_to_csv.py

shp_to_csv.py

us_census_gazetteer_ne.txt

us_census_gazetteer_ne.txt

Repository files navigation

Twitter Sentiment Analysis

Screenshots

Interactive Demo

Structure

About

Releases

Packages

Languages

NairVish/tweet-sentiment-analysis

Folders and files

Latest commit

History

Repository files navigation

Twitter Sentiment Analysis

Screenshots

Interactive Demo

Structure

About

Resources

Stars

Watchers

Forks

Languages