Classification of MovieLens Tabular Data

Project for the course "Data Analytics" of the University of Bologna, A.Y. 2021/2022. In this project a data pipeline was implemented to predict the average mark of a film from its features, using Machine Learning techniques.

Developers

Setup

To execute the script, Python must be installed, and some external libraries must be downloaded and installed using the pip (or pip3) package manager:

pip install -r requirements.txt

We recommend the use of a virtual environment such as conda, for example, for package installation and project execution.

Environment variable

The file .env.example must be renamed to .env and the single variable TMDB_API_KEY must be set to the respective key of TMDB. You only need to specify it if you want to download the TMDB dataset via api calls.

Usage

python main.py -h

usage: main.py model [--random | --best]

Data Analytics project using MovieLens dataset.

positional arguments:
  {mlp,tree_based,svm,naive_bayes}
                        the name of the model

options:
  -h, --help            show this help message and exit
  -r, --random          demo purpose, use only one random configuration for hyperparams
  -b, --best            use the best training configuration

Notebooks

The notebooks contain fundamental project parts that have been implemented for greater understanding. In order to avoid errors, we recommend running the notebooks in alphabetical order.

Report

The report describing the various parts of the project from both an implementation and conceptual point of view is the following: main.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 317 Commits
notebooks		notebooks
reports		reports
src		src
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

notebooks

notebooks

reports

reports

src

src

.env.example

.env.example

.gitattributes

.gitattributes

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

main.py

main.py

requirements.txt

requirements.txt

Repository files navigation

Classification of MovieLens Tabular Data

Developers

Setup

Environment variable

Usage

Notebooks

Report

About

Releases 1

Packages

Contributors 2

Languages

License

prushh/movie-lens-mlp

Folders and files

Latest commit

History

Repository files navigation

Classification of MovieLens Tabular Data

Developers

Setup

Environment variable

Usage

Notebooks

Report

About

Topics

Resources

License

Stars

Watchers

Forks

Languages