Support sklearn.compose.TransformedTargetRegressor #240

aidiss · 2022-04-18T15:09:15Z

Currently TuneSearchCV fails when provided with LGBMRegressor provided wrapped inside TransformedTargerRegressor.

For example, this block would fail

regressor = LGBMRegressor(**config)

regressor = TransformedTargetRegressorTwo(
    regressor=regressor, func=np.log1p, inverse_func=np.expm1
)

Failure happens partly because of failure of tune_search.utils.get_early_stop_type to read the type of the estimator.

Note sure if this is the correct implementation. But it should not break anything.

Yard1 · 2022-04-18T16:09:34Z

Hey, this is a good catch. I think we would want to make this more generic if possible. We would eventually want to support other meta-estimators (including user-defined ones). Perhaps we can allow the user to just directly specify the early stop type. Also, I am not sure if we wouldn't need to change the logic related to early stopping in other places, too.

Can you add a unit test for this? Thanks!

aidiss · 2022-04-18T20:26:57Z

Added a test.
Let me know if its in the right place and done in a right way.

aidiss · 2022-04-20T04:35:38Z

Secondly, I wonder, what could be implementation for handling estimators that are inside pipelines.
I guess we could traverse the estimators and look for the one that is supported? Or vice versa, skip the ones that are meta?
Or, maybe it could be possible to tell specifically what kind of early stopping is used by the estimator?

Yard1 · 2022-04-20T13:13:04Z

tune_sklearn/utils.py

@@ -101,6 +103,14 @@ def get_early_stop_type(estimator, early_stopping):

    if not early_stopping:
        return EarlyStopping.NO_EARLY_STOP
+
+    if check_is_pipeline(estimator):
+        for step_name, step in estimator.steps:


Three things:

we shouldn't be only checking for regressors

this is not foolproof & goes against duck typing principle of sklearn

we can just assume that the last step of the pipeline is an estimator. You can't really make a pipeline with multiple estimators.

Yard1 · 2022-04-22T18:17:11Z

Hey @aidiss I took a look at the logic and I unfortunately don't think this will work. We may detect the early stop type correctly, but we are unable to apply it during actual training if the target transformer is present. It would require a more through refactoring of the code. I'll put this on the backlog, unless you would be willing to work on this. We would need to support both the case with a pipeline and without it.

Support sklearn.compose.TransformedTargetRegressor

c8e509c

Yard1 self-assigned this Apr 18, 2022

Add tests

d5b8ae4

Support sklearn Pipeline

980ac20

Yard1 reviewed Apr 20, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support sklearn.compose.TransformedTargetRegressor #240

Support sklearn.compose.TransformedTargetRegressor #240

aidiss commented Apr 18, 2022

Yard1 commented Apr 18, 2022 •

edited

aidiss commented Apr 18, 2022

aidiss commented Apr 20, 2022

Yard1 Apr 20, 2022 •

edited

Yard1 commented Apr 22, 2022

Support sklearn.compose.TransformedTargetRegressor #240

Are you sure you want to change the base?

Support sklearn.compose.TransformedTargetRegressor #240

Conversation

aidiss commented Apr 18, 2022

Yard1 commented Apr 18, 2022 • edited

aidiss commented Apr 18, 2022

aidiss commented Apr 20, 2022

Yard1 Apr 20, 2022 • edited

Choose a reason for hiding this comment

Yard1 commented Apr 22, 2022

Yard1 commented Apr 18, 2022 •

edited

Yard1 Apr 20, 2022 •

edited