Skip to content

riccardocadei/Higgs-Boson-Challange-2020-EPFL

Repository files navigation

Higgs Boson Challenge 2020 EPFL

General Information

The repository contains the code for Machine Learning course 2020 (CS-433) project 1 (Higgs Boson challenge: https://www.aicrowd.com/challenges/epfl-machine-learning-higgs/leaderboards) at EPFL. More information about this challenge can be found in the folder documents.

Team

The project is accomplished by team INteam with members:

With a Test Accuracy of 0.841 we got the 7-th place out of 277 teams.

Data

The data train.csv and test.csv should be found in https://github.com/epfml/ML_course/tree/master/projects/project1/data, to run the code please download and place them in the data folder

Environment

The project has been developed and test with python3.6. The required library for running the models and training is numpy. The library for visualization is matplotlib.

Results

Results to predict the test datasets are generated by running: python3 run.py. And the final results are saved in: /data/finalsubmission.csv.


Project structure

Training data

implementations.py: the implementation of 6 methods to train the model.

run.py: the results after using the selected model to predict test data.

Processing data

exploration.py: understanding the features of data with visualization.

process_data.py: preprocessing data for model training and prediction.

Selecting Model

crossvalidalidation.py: using cross-validation to test the accuracy of different models.

select_parameter.py: searching for the appropriate parameters(lambda, degree etc.) for models.

Notebook

main.ipynb: tuning the best parameters for ridge regression and predicting the accuracy of all the methods through cross validation.

plots.ipynb: data analysis and visualizion of the accuracy and error with different choices of parameters.

Report

documents/report.pdf: a 2-pages report of the complete solution.

About

A classification problem on a big physical dataset simulated by the ATLAS experiment from CERN - @ EPFL

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published