Skip to content

An data analysis project performed on the US video dataset from youtube.

Notifications You must be signed in to change notification settings

talesmarra/youtube_data_analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Analysis on the US video database of the Youtube Dataset

This project aims to perform some data analysis in the Youtube dataset, especially considering the US videos, to try to uderstand what a video must have in order to be viewed and when it should go online, in order to maximize the number of views, and therefore, the profit of the publisher.

A similar procedure can be used to check data from other countries.

Results are presented in the notebook, as well as the general procedure and hypothesis.

Base on this, created also an Youtube Helper, a product that can simulate an assitant to a Youtuber in order to help him make his decisions, about the title and tags of the video, as well as the time of publication.

The helper can be run with streamlit:

streamlit run app.py

Author:

  • Tales Marra

Environement

In order to be able to execute the following steps, you will need to create a Python 3 Environement. This can be done by:

pip install requirements.txt

Directory Structure

youtube_data_analysis
│   README.md 
│   requirements.txt
├── data
│   │   US_category_id.json                 <- metadata
│   │   USvideos.csv                        <- csv file with data
│   .gitignore                              <- gitignore
│   Youtube_US_videos_data_analysis.ipynb   <- data analysis notebook
|   app.py                                  <- the youtuber helper

About

An data analysis project performed on the US video dataset from youtube.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published