Investigate the discrepancy in default hyperparams compared to LightGBM #32

ogrisel · 2018-11-01T16:15:20Z

Possible culprits:

shrinkage / learning_rate
min_samples_leaf
min_child_samples

The text was updated successfully, but these errors were encountered:

ogrisel · 2018-11-01T17:25:10Z

As @NicolasHug noted, our min_samples_leaf in pygbm is not correct. I would rather implement what LightGBM does, that is reject splits that would result in one of the child nodes having less than min_samples_leaf.

NicolasHug · 2018-11-01T18:27:06Z

You mean sklearn?

LightGBM is doing something very weird with min_sample_leaf, it looks like it is ignored because of num_leaves (see #30 (comment))

guolinke · 2018-11-02T05:10:13Z

@NicolasHug I think you used the wrong parameter name in that code.

ogrisel · 2018-11-02T10:02:17Z

Indeed. It's actually the pygbm handling of min_samples_leaf that is broken. See: #34.

This was referenced Nov 3, 2018

Fix handling of the min_samples_leaf hyperparameter #35

Merged

mean_samples_leaf does not do what it's suppose to do #34

Closed

Benchmark results with better parameters #30

Closed

NicolasHug mentioned this issue Nov 8, 2018

[MGR] Strict comparison to min_gain_to_split #40

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate the discrepancy in default hyperparams compared to LightGBM #32

Investigate the discrepancy in default hyperparams compared to LightGBM #32

ogrisel commented Nov 1, 2018

ogrisel commented Nov 1, 2018

NicolasHug commented Nov 1, 2018

guolinke commented Nov 2, 2018

ogrisel commented Nov 2, 2018

Investigate the discrepancy in default hyperparams compared to LightGBM #32

Investigate the discrepancy in default hyperparams compared to LightGBM #32

Comments

ogrisel commented Nov 1, 2018

ogrisel commented Nov 1, 2018

NicolasHug commented Nov 1, 2018

guolinke commented Nov 2, 2018

ogrisel commented Nov 2, 2018