Skip to content

deema-A/Song-Play-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Song Play Analysis

This project is a data analysis for songs dataset pertaining a startup called Sparkify. The analytics particularly concern the songs in addition to the user activity on Sparkify new music streaming app. Sparkify aims to understand what songs users are listening to. To fulfill this purpose their data needs to be modeled in order to easily querying the data.

In this project:

  • Converting raw data into well structured data warehouse.
  • Creating a database schema of Facts and Dimensions tables.
  • Building the ETL pipelines.

Prerequisites:

  • Python3

How to run it:

  • Run create_tables.py.
  • Run etl.py.
  • Start tweaking etl.ipynd if you like.

The project contains:

  • Test.ipynb > to check the update on the warehouse.
  • sql_queries.py > contians basic (Drop tables, Create tables, Insert and Select) quieries.
  • etl.py > etl pipelines.
  • etl.ipynb > nontebook for the same goal as etl.py except it gives a closer look to the data and not considering the whole process.
  • Create_tables.py > where connections to database and queries execution take place.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published