Project to determine the ratings of a movie using the movie-lens dataset.
Download ml-latest-small.zip dataset @ http://grouplens.org/datasets/movielens/
- Hive
- Hbase
- Spark
- List all the movies and the number of ratings
- List all the users and the number of ratings they have done for a movie
- List all the Movie IDs which have been rated (Movie Id with at least one user rating it)
- List all the Users who have rated the movies (Users who have rated at least one movie)
- List of all the User with the max ,min ,average ratings they have given against any movie
- List all the Movies with the max ,min, average ratings given by any user
- Hive-HBase Integration
- Spark (RDD, Dataframe, DataSet) solving individually for each category