Skip to content

Elmehdi9/ETL-Project-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Movies-ETL

Project Overview

Overview

In this project I will create a an automated pipeline that takes in scraped data from Wikipedia and IMDB, then transform and load it o an already existing PostgreSQL database.

ETL Image

Worflow

  1. Read three data files (IMDB- Wikipedia- Ratings).
  2. Extract and Transform data.
  3. Load data to a PostgreSQL Movie Database.

Prerequisites

Software: Python, Anaconda Navigator, Conda, Jupyter Notebook, PostgreSQL, pgAdmin 4.


Loading data in the PostgreSQL Movie Database



ETL Image

Summary

The ETL jupyter notebook created collects and cleans movie data from different sources (Wikipedia JSON and Kaggle and ratings csv files). It transforms and merges the data and loads it into two updatable PostgreSQL database table.

About

Extract IMDB movies data, then Transform and Load it o an already existing PostgreSQL database.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published