udacity

Real-world data rarely comes clean. Using Python and its libraries, you will gather data from a variety of sources and in a variety of formats, assess its quality and tidiness, then clean it. This is called data wrangling. You will document your wrangling efforts in a Jupyter Notebook, plus showcase them through analyses and visualizations using Python (and its libraries) and/or SQL.

The dataset that you will be wrangling (and analyzing and visualizing) is the tweet archive of Twitter user @dog_rates, also known as WeRateDogs. WeRateDogs is a Twitter account that rates people's dogs with a humorous comment about the dog. These ratings almost always have a denominator of 10. The numerators, though? Almost always greater than 10. 11/10, 12/10, 13/10, etc. Why? Because "they're good dogs Brent." WeRateDogs has over 4 million followers and has received international media coverage.

WeRateDogs downloaded their Twitter archive and sent it to Udacity via email exclusively for you to use in this project. This archive contains basic tweet data (tweet ID, timestamp, text, etc.) for all 5000+ of their tweets as they stood on August 1, 2017.

Your goal: wrangle WeRateDogs Twitter data to create interesting and trustworthy analyses and visualizations. The Twitter archive is great, but it only contains very basic tweet information. Additional gathering, then assessing and cleaning is required for "Wow!"-worthy analyses and visualizations.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
act_report.pdf		act_report.pdf
df_mg.csv		df_mg.csv
df_tw_master.csv		df_tw_master.csv
dogbar.png		dogbar.png
image-predictions.tsv		image-predictions.tsv
image_predictions.tsv		image_predictions.tsv
insightfour.png		insightfour.png
insightthree.png		insightthree.png
output.png		output.png
output2.png		output2.png
output3.png		output3.png
tweet_json.txt		tweet_json.txt
twitter-archive-enhanced.csv		twitter-archive-enhanced.csv
twitter_archive_master.csv		twitter_archive_master.csv
wrangle_act.ipynb		wrangle_act.ipynb
wrangle_report.pdf		wrangle_report.pdf

MSjoia/project-4_udacity_Wrangle-and-Analye-Data

Folders and files

Latest commit

History

Repository files navigation

udacity

About

Topics

Resources

Stars

Watchers

Forks

Languages