Successive halving for hyperparameter tuning #27923

mstrmnd1 · 2023-12-08T17:26:52Z

mstrmnd1
Dec 8, 2023

Was reading over the HalvingRandomSearchCV, but still couldn't quite understand what you guys mean by "resources". Here is my current understanding, please feel free to correct me:

Unlike RandomSearchCV or GridSearchCV, this method does not use the entire dataset for a cross-validation set. When iterations have just begun, only a small fraction of the data is taken out for each model to do a standard K-Fold CV evaluation. I am guessing that both the training subset and the validation subset are taken from this fractional dataset?

Then, technically speaking, won't we always expect to see a performance increase over time, simply and most directly due to more training data?

glemaitre · 2023-12-11T10:21:34Z

glemaitre
Dec 11, 2023
Maintainer

what you guys mean by "resources"

By default the resources is just the number of samples and this is this part on which the halving is happening.

However, you could imagine that the resources could be another parameter, e.g. the number of trees in an random forest. Then, you would use the halving strategy on the number of trees.

2 replies

mstrmnd1 Dec 11, 2023
Author

Still confused. Do you mean the number of samples in the dataset?

glemaitre Dec 11, 2023
Maintainer

yes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Successive halving for hyperparameter tuning #27923

{{title}}

Replies: 1 comment 2 replies

{{title}}

{{title}}

{{title}}

Select a reply

Successive halving for hyperparameter tuning #27923

mstrmnd1 Dec 8, 2023

Replies: 1 comment · 2 replies

glemaitre Dec 11, 2023 Maintainer

mstrmnd1 Dec 11, 2023 Author

glemaitre Dec 11, 2023 Maintainer

mstrmnd1
Dec 8, 2023

Replies: 1 comment 2 replies

glemaitre
Dec 11, 2023
Maintainer

mstrmnd1 Dec 11, 2023
Author

glemaitre Dec 11, 2023
Maintainer