Skip to content

Movie Studio Recommendations is an early project with in-depth data analysis and visualizations, featuring extensive web scraping.

License

Notifications You must be signed in to change notification settings

threnjen/Movie_Studio_Recommendations

 
 

Repository files navigation

Module 1 Final Project - Movie Studio Recommendations

For: Microsoft Inc. January 2020

By: Jenny Wadkins

Introduction

The time is 2020, in a pre-Covid world.

Microsoft sees all the big companies creating original video content, and they want to get in on the fun. They have decided to create a new movie studio, but the problem is they don’t know anything about creating movies. They have hired you to help them better understand the movie industry. Your team is charged with doing data analysis and creating a presentation that explores what type of films are currently doing the best at the box office. You must then translate those findings into actionable insights that the CEO can use when deciding what type of films they should be creating.

Questions

  • Should Microsoft focus on launching a franchise, or focus on single film IPs?
  • What audience should the company aim for (based on MPAA rating)?
  • What genre(s) should the company aim for?
  • What combination of genre, franchise and rating will result in the lowest risk?
  • What, if any, genres/ratings/franchises should be avoided?

Methodology

  • Source data using a combination of API calls and web scraping
  • Clean and organize data
  • Perform exploratory data analysis
  • Provide conclusions based on analysis

Table of Contents

Notebook Preparation

  • Recommended Extensions
  • Importing our Modules
  • Notebook-wide Functions

Data Acquisition and Cleaning

  • Source 1 - The Movie Database
  • Source 2 - Box Office Mojo
  • Source 3 - IMDB
  • Source 4 - The Numbers

Data Joins and Data Summary

  • Data Join Plan
  • Daraframe Joins
  • Data Summary

Exploratory Data Analysis

  • Studying Correlations
  • Franchises
  • Genres
  • User Ratings
  • MPAA Rating
  • Studios
  • Failures
  • Trends Over Time
  • Director
  • Writer
  • Actors

Conclusions

  • Answers to business questions

Recommendations

Future Work

Errors to Consider

  • API tool to obtain IMDB IDs
  • Web scraper to get box office numbers
  • Web scraper to obtain franchise status

Analysis and Conclusion

Should Microsoft focus on launching a franchise, or focus on single film IPs?

Franchises generally have a higher budget, but their failure rate is far less and they are more likely to a) be profitable and b) yield a surprise hit. They do, however, have a higher variance in both budget and net income. This risk is offset by the observation that franchise films rarely fail, whereas non-franchise films make up the majority of failed films in the last 20 years. A franchise is recommended. Figure 1 - Net Income of Franchise vs Non-Franchise

What audience should the company aim for (based on MPAA rating)?

Various analysis shows that a movie's performance range declines in both consistency and average net income as the rating increases from G to R. G movies have both the highest consistent income range, and the lowest rate of overall failure. PG is shortly behind. PG-13 is most likely to result in a breakout hit, but has a much lower consistent income range. Lower ratings are the safest and most consistent. Net by Rating

What genre(s) should the company aim for?

The genres of Adventure, Animation, Fantasy, Sci-Fi and Action are the most strongly correlated with a positive net income. Net by Genre

What combination of genre, franchise and rating will result in the lowest risk?

While franchises involve a higher variance in both budget and net, this risk is offset by their overall higher success rate, with very few franchise films failing to yield a positive net income. This risk is further reduced by sticking to lower MPAA ratings in the G or PG range, and selecting genres that complement these ratings from the animation, sci-fi, fantasy, adventure and action categories. Overall, a G or PG franchise that utilizes Animation and a complementary category such as Adventure should be a very safe endeavor.

What, if any, genres/ratings/franchises should be avoided?

Analysis of the past 20 years first shows that non-franchise films both make less money and are more likely to fail. Most of the film failures in this time period were not franchises. Films are also more likely to fail as their MPAA rating increases. R-movies in particular have a very high rate of failure, with over 30% of the failures of the last 20 years rated R. Finally, certain genres are clearly more likely to fail, with the Drama and Comedy genres representing about 58% and 42% of failures respectively (these numbers add up to 100, but these genres may be shared together on one film, meaning that film is present in both numbers). Things to be avoided include non-franchise films, higher MPAA ratings, and the Comedy, Drama and Thriller categories. These things should certainly be avoided individually, and definitely avoided in combination. Failures by Rating Failures of Non-Franchise vs Franchise Failures by Genre

Recommendations

Exploring this data led to some really strong conclusions for Microsoft. We have evidence that a FRANCHISE is definitely the way to go, mining some existing IP. Family-friendly rated movies in G or PG are the least risky. They are less likely to result in a surprise breakout hit, but they are consistent performers. Finally, Animation, Adventure, and Fantasy are consistently performing genres. When you combine these components together, you can easily identify a Microsoft-owned IP which is family friendly, lends itself to Animation/Fantasy, and has a user base of 126 million, according to Statista. This IP is, of course, Minecraft. Minecraft

Future Work

  • We can look at the writers/directors that have worked on the genres/ratings we are focusing on
  • As far as casting decisions - we can study the importance of big names and cast size for franchise films
  • We can look at specific release dates and if any particular time of year is most successful for our rating and genres
  • And of course, we can consider additional revenues possibilities from merchandising

Presentation

Video - Data Science Module 1 Project

PDF of Presentation

About

Movie Studio Recommendations is an early project with in-depth data analysis and visualizations, featuring extensive web scraping.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%