Skip to content

The goal of this project is to visualize the top searches for common health issues in the United States, from Cancer to Diabetes, and compare them with the actual location of occurrences for those same health conditions to understand how search data reflects life for millions of Americans.

Notifications You must be signed in to change notification settings

ermiasgelaye/Google-Health-Search-Project

Repository files navigation

Google Health Search Trends by US Metropolitan Area, 2004 – 2017

Project Goal

Google Trends data allows us to see what people are searching for at a very local level. The goal of this project is to track, and visualize the top searches for common health issues in the United States, from Cancer to Diabetes, and compare them with the actual location of occurrences for those same health conditions to understand how search data reflects life for millions of Americans.

Research Question

How have the most searched diseases changed over the years in the US?

Data sources

Data Architectural Diagram

ETL

EXTRACT

Data sourced from Google Trend, specifically Google Health Search from 2004 - 2017.

Transform

Data cleaned and transformed by using Python Jupyter Notebook. Health_Analysis.ipynb

LOAD

  • This project used Python Jupyter Notebook to load transformed data in to PostgreSQL database. loadData.ipynb

  • Python Flask–powered RESTful API were used to deploy the data into the web, and API end point links created.

Deployment

The app is deployed in Heroku in order to access the page click the following link Eagle Dashboard

Data Analysis and Visualization

The following visualization made:

Health Search Volume by State and Region (Choropleth map)

Interactive Charts With Dropdown Selection "City"

Health Search Volume by Year (Single Line Chart)

Health Search Volume by Year and Condition (Multiple Line Chart)

Health Search volume by State.

Correlation Between Health Conditions

Boxplot of Health Google Search 2004-2017

Radar Plot on All Time Total Volume of Health Searches

Radar Plot on the Sum Total Volume of 10 Leading Causes of Death Per 100,000 Population from 2004-2017

Team members (Team Eagle)

  • Adedamola Atekoja (‘Damola)
  • Amanda Qianyue Ma
  • Amos Johnson
  • Ermias Gaga
  • Maria Lorena

About

The goal of this project is to visualize the top searches for common health issues in the United States, from Cancer to Diabetes, and compare them with the actual location of occurrences for those same health conditions to understand how search data reflects life for millions of Americans.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •