Skip to content

This repository contains a Python script for web scraping IMDb website to extract information about the top TV shows of all time. The script retrieves data such as title, year of release, genre, rating, runtime, number of votes, and cast. The extracted data is then saved in a CSV file for further analysis.

jaiswal-ashutosh/web_scraping_project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

IMDb Web Scraping Project

This is a Python project that scrapes data from the IMDb website for the top-rated TV shows and saves it in a CSV file. It uses the requests and BeautifulSoup libraries to extract information such as the show's title, year of release, rating, genre, runtime, number of votes, and stars.

Requirements

This project requires Python 3.x and the following libraries:

  • requests

  • BeautifulSoup

  • pandas

  • You can install these libraries using pip:

    pip install requests beautifulsoup4 pandas

    Usage

    To use the project, simply run the imdb_web_scraping.py file.
    The script will extract the data from the IMDb website and save it in a CSV file called imdb_top_rated_tv_shows.csv.



    The data includes the following fields:

  • TitleYear
  • Rating
  • Genre
  • Runtime
  • Number of votes
  • Stars


  • Credits

    This project was created by Ashutosh Jaiswal. The project uses data from IMDb

    About

    This repository contains a Python script for web scraping IMDb website to extract information about the top TV shows of all time. The script retrieves data such as title, year of release, genre, rating, runtime, number of votes, and cast. The extracted data is then saved in a CSV file for further analysis.

    Topics

    Resources

    Stars

    Watchers

    Forks

    Releases

    No releases published

    Packages

    No packages published