Skip to content

GCP hosted product for over 1 million movie investors on HSX.com, aiding online movie trading and box-office investments by leveraging Big Data technologies like Hive and Hadoop, and Tableau dashboards

Notifications You must be signed in to change notification settings

akshay-madar/MovieTycoon-gcp-based-BI-tool

Repository files navigation

Movie-Tycoon: Big Data and NLP for movies

  • Developed GCP-hosted product for over 1 million movie investors on HSX.com, aiding online trading by designing end-to-end Hive workflow using MapReduce; weekly update of Box Office trends by dynamic web scraping
  • Instituted proposal for producers to leverage proactive script writing by analyzing ~480K movie reviews on Rotten Tomatoes

Problem Statement:

Movie Tycoon is a platform which helps movie investors on https://www.hsx.com/ by providing insights on where and whom to invest the money on. The aim is to provide creative personnel with a tool to analyze reviews and use it as a feedback for future projects. This way, investors can identify the right price for investments in cinema business, and theatre owners can schedule movie shows based on box office predictions.

Process:

  1. Deployed Python Web Scraping tools to build a corpus of data that could be leveraged for ‘NLP Modeling’
  2. Leveraged HIVE platform to query solutions on movie database
  3. Used Naïve Bayes Algorithm to identify the sentiment of the trends




Results:

  1. Understand Box Office Trends - The top box office returns are observed in Action, Musical and Family genres
  2. Leverage NLP to understand critics reviews - The top words in movies having positive reviews are
  • Story
  • Compelling
  • Performance
  • Brilliant Drama
  1. Movie Business Landscape Analysis - The top movies produced are produced in Drama, Thriller and Comedy genre
  2. Entire product uses real time predictions every Monday at 8am using Hive for scheduling automated workflows

Youtube link:

Visit the following link to listen to the product pitch --> https://www.youtube.com/watch?v=wpuIuco7MX0

Tableau Dashboard


About

GCP hosted product for over 1 million movie investors on HSX.com, aiding online movie trading and box-office investments by leveraging Big Data technologies like Hive and Hadoop, and Tableau dashboards

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published