Error in matches predictions #71

andony-arrieula · 2024-02-28T16:31:46Z

There are 2 errors in the match prediction section:

the data used to make the prediction is not updated, it's the data of the previous match that is reused
the data is passed as a list of values, but the list is not given in the same order as during the training

The match prediction is not usable then.

andony-arrieula · 2024-03-01T10:56:44Z

It seems it's not the data not in the good order, but the preprocessing done in preprocess_dataset which is not performed on the prediction data.

kochlisGit · 2024-03-01T12:11:47Z

This should happen only during Cross-Validation process and it's perfectly normal. The idea is that we randomly selected a train set and a n evaluation set, to measure the performance of the models.

However, during the training period (ONLY), it should use the first 100 matches as test and the rest of the dataset should be processed in the correct order!
Otherwise, can you print the results of the preprocess_dataset for the evaluation data to verify this?

andony-arrieula · 2024-03-01T12:23:21Z

The problem is not on the evaluation data, but on the prediction data passed to the predict match dialog.

For example, I launch my trained model on Ligue 1 French league, I select Paris SG as home team (the best team in the league) and Clermont as away team (the worst team in the league), and I enter the same odd (3.00) for all possible results, the algorithm gives me probabilities of 0.32 for Paris 0.23 a draw and 0.44 for Clermont, which is totally incoherent, and this is because the data given to the model is not processed before it was passed to the model.

kochlisGit · 2024-03-01T12:25:25Z

So the program grabs the features from the history tables, but does not preprocess them beforing passing them to model for prediction?

andony-arrieula · 2024-03-01T12:29:13Z

Exactly !

But the program also does not update the statistics with the data of the previous match !

kochlisGit · 2024-03-01T12:39:48Z

Thanks. I will take a look into it.

andony-arrieula · 2024-03-01T12:42:43Z

I am also looking on that issue on my own side, I think the best way to proceed is to modify the construct_input() method to process the data before giving it to the model to make the prediction.

kochlisGit · 2024-03-01T12:45:59Z

Yeah, I think that would be the best way. The rows should be processed before returned using the scaler of them model's config (if any)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error in matches predictions #71

Error in matches predictions #71

andony-arrieula commented Feb 28, 2024

andony-arrieula commented Mar 1, 2024

kochlisGit commented Mar 1, 2024

andony-arrieula commented Mar 1, 2024

kochlisGit commented Mar 1, 2024

andony-arrieula commented Mar 1, 2024

kochlisGit commented Mar 1, 2024

andony-arrieula commented Mar 1, 2024

kochlisGit commented Mar 1, 2024

Error in matches predictions #71

Error in matches predictions #71

Comments

andony-arrieula commented Feb 28, 2024

andony-arrieula commented Mar 1, 2024

kochlisGit commented Mar 1, 2024

andony-arrieula commented Mar 1, 2024

kochlisGit commented Mar 1, 2024

andony-arrieula commented Mar 1, 2024

kochlisGit commented Mar 1, 2024

andony-arrieula commented Mar 1, 2024

kochlisGit commented Mar 1, 2024