Skip to content

ritika-0111/Movie-Recommendation-on-IMDB-Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Movie-Recommendation-on-IMDB-Dataset

The dataset is IMDB top 250 English movies, it can be downloaded from: https://data.world/studentoflife/imdb-top-250-lists-and-5000-or-so-data-records.
In this dataset there are 250 movies (rows) and 38 attributes (columns).
I have used Rapid Automatic Keyword Extraction (RAKE) library, it is a domain independent keyword extraction algorithm which tries to determine key phrases in a body of text by analyzing the frequency of word appearance and its co-occurance with other words in the text.
This project is Content-based Recommender Using Natural Language Processing (NLP).
Strategy:
Count Vectorizer + Cosine Similarity

  • Count Vectorizer : for converting sentences into vectors
  • Cosine Similarity : calculates similarity by measuring the cosine of angle between two vectors.

About

Movie Recommendation - provides user with the top choices of movie he/she wanted to watch based on their current choice

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published