Increase compatibility of EBM with scikit-learn #518

DerWeh · 2024-02-28T21:42:13Z

This PR adds the estimator checks for scikit-learn compatibility.

We add the following compatibilities:

add a warning if a 2D y input is used, where we drop the last dimension.
we do not use a mutable empty list as default argument
we add a tag that EBMs support NaN inputs in the features
we check for inf in the classes
in case of complex inputs we raise an ValueError instead of a TypeError

Signed-off-by: DerWeh <andreas.weh@web.de>

Raising a ValueError is compatible with sklearn and therefore the expected behavior. TypeError is inappropriate in the Python world, as `type(X)` has the correct type (an np.ndarray). Just the X.dtype is wrong. This is a NumPy concept and not a native Python concept. Signed-off-by: DerWeh <andreas.weh@web.de>

Signed-off-by: DerWeh <andreas.weh@web.de>

DerWeh · 2024-02-28T21:51:35Z

We still fail some scikit-learn tests, where I am not sure if we intentionally deviate.
These are open TODOs/FIXME in skip_sklearn.

I didn't remove the old (currently skipped) test_scikit_learn_compatibility. Can this test be removed, or is there something important which should be added?

The private versions aren't test at the moment. They fail with some cryptic errors which I do not understand. Sorry, but I don't really understand the DP versions, so I cannot easily fix this part.

I think we need these tests back, before continuing to work on #514, as #514 would mean a rather drastic refactor.

python/interpret-core/interpret/glassbox/_ebm/_ebm.py

paulbkoch · 2024-02-29T08:38:33Z

We still fail some scikit-learn tests, where I am not sure if we intentionally deviate. These are open TODOs/FIXME in skip_sklearn.

I didn't remove the old (currently skipped) test_scikit_learn_compatibility. Can this test be removed, or is there something important which should be added?

The private versions aren't test at the moment. They fail with some cryptic errors which I do not understand. Sorry, but I don't really understand the DP versions, so I cannot easily fix this part.

I think we need these tests back, before continuing to work on #514, as #514 would mean a rather drastic refactor.

On the scikit-learn tests:

you're right that we currently don't support sample weights of 0. There was some issue that I had with them, so I disabled it temporarily. I think it was a division by zero error if all the samples in a tree leaf had zero weights. I think this is something that we can eventually support, but I want to be careful about re-enabling it.
EBMs do support strings. We support both nominal and ordinal categoricals because we want to be able to show more information in the UI about these kinds of features. Other packages make you convert a string like "low", "medium" and "high" into 1, 2, 3, but we want the original strings that have more innate human interpretability.
fitting zero features should be eventually supported, but I'm not sure if it is today. Fitting zero features can be useful if you just want to know what the intercept would be without any features. For RMSE regression this isn't very useful, but it could be more useful for non-standard objectives. In any case, I'm not too stressed if we don't support this today as long as we do someday.
Is the 1.0 != "1.0" error on y values or X values? We should be treating features with all floating-point numbers as continuous since if we left them as strings then they'd be treated as categoricals, which would be surprising to many people and might not be noticed. That would make them nominals and would lead to unexpected lower accuracy.
we do accept 1D X arrays. I'm a little surprised scikit-learn doesn't too actually for predict since it's handy to slice off a single sample for prediction and sometimes people (me especially) forget to keep the same shape. I figure if the user made a mistake and is missing a dimension, then they'll get an exception when the number of samples doesn't match the number in y. Now that I think of it, probably scikit-learn doesn't allow this since they have some methods that only accept an X, so for them it could sometimes be ambiguous if the one dimension was for samples or for features, but for us this isn't an issue since during fitting we always have a y to corroborate the number of samples, or we have a fitted model that can corroborate the number of features.

We can remove the old test_scikit_learn_compatibility test that wasn't enabled.

I can have a look at the DP tests once the rest of this PR is in.

python/interpret-core/interpret/glassbox/_ebm/_ebm.py

python/interpret-core/setup.py

Support of 2d y we `y.shape[-1] = 1` is desired This reverts commit 230cddd. Signed-off-by: DerWeh <andreas.weh@web.de>

Signed-off-by: DerWeh <andreas.weh@web.de>

DerWeh · 2024-02-29T19:50:43Z

EBMs do support strings. We support both nominal and ordinal categoricals because we want to be able to show more information in the UI about these kinds of features. Other packages make you convert a string like "low", "medium" and "high" into 1, 2, 3, but we want the original strings that have more innate human interpretability.

I updated the tags, indicating this, which allowed re-enabling the test. Please take a look if the tags seem suitable now. Check ExplainableBoostingClassifier._get_tags() (and the others) and compare to the estimator tags. We could also use this to skip the tests, but the documentation warns against this.

Is the 1.0 != "1.0" error on y values or X values?

The scikit-learn fits floating point y=[1.0,...], and complains that the predictions differ, as EBM predicts string labels pred=["1.0", ...]. Thus, I disabled the test. Probably the upstream test should be changed, EBM behavior seems the more reasonable approach to me.
.

we do accept 1D X arrays. I'm a little surprised scikit-learn doesn't too actually for predict since it's handy to slice off a single sample for prediction and sometimes people (me especially) forget to keep the same shape. I figure if the user made a mistake and is missing a dimension, then they'll get an exception when the number of samples doesn't match the number in y. Now that I think of it, probably scikit-learn doesn't allow this since they have some methods that only accept an X, so for them it could sometimes be ambiguous if the one dimension was for samples or for features, but for us this isn't an issue since during fitting we always have a y to corroborate the number of samples, or we have a fitted model that can corroborate the number of features.

I guess the ambiguity is whether we have a single sample and the dimension represents features, or a single feature and the dimension represents samples. Anyway, test is skipped.

We can remove the old test_scikit_learn_compatibility test that wasn't enabled.

Done

I can have a look at the DP tests once the rest of this PR is in.

Then I'll leave this up to you.

I think the PR should be good to merge now, or do you want me to check out if we can be more permissive about older scikit-learn versions? As I said, you can validate that the correct tags are set.

paulbkoch · 2024-02-29T22:15:35Z

Is the 1.0 != "1.0" error on y values or X values?

The scikit-learn fits floating point y=[1.0,...], and complains that the predictions differ, as EBM predicts string labels pred=["1.0", ...]. Thus, I disabled the test. Probably the upstream test should be changed, EBM behavior seems the more reasonable approach to me. .

Ah, now I remember about this, I converted y floats to strings because JSON doesn't differentiate between integers and floats. It just has a number type, which are always floats. I wanted to be able to serialize to JSON and back to a python object. Without indicating the original datatype it would be ambiguous whether the JSON value 1 should be restored in python as an integer or a float type. I figured integers would be more common than floats for y, so I converted floats to strings.

If we want to support both integers and floats, we can do it by including an "output_type" field for classifiers in the JSON to differentiate between integers and floats here:

interpret/python/interpret-core/interpret/glassbox/_ebm/_json.py

Line 79 in 0cf3f2f

output["classes"] = ebm.classes_.tolist()

I think the PR should be good to merge now, or do you want me to check out if we can be more permissive about older scikit-learn versions? As I said, you can validate that the correct tags are set.

If the rest of the PR is ready I can just merge it first and check afterwards. It's probably fine to use 0.24 but you never know what old environment someone might want to install interpret in, especially a cloud based one where the user might not even be able to upgrade.

I'll review now.

DerWeh · 2024-02-29T23:06:35Z

If the rest of the PR is ready I can just merge it first and check afterwards. It's probably fine to use 0.24 but you never know what old environment someone might want to install interpret in, especially a cloud based one where the user might not even be able to upgrade.

Just tried to test with an old version, which failed as some test I skip didn't exist in old version... If we compare the functions names instead of functions itself, we can alleviate this problem.

But more importantly: we actually don't need to bump the version at all, it suffices to increase the scikit-learn version in the testing dependencies. It shouldn't be necessary to bother people just using it.

paulbkoch · 2024-02-29T23:10:49Z

Thanks @DerWeh -- It's really nice to finally have scikit-learn verification!

DerWeh added 8 commits February 28, 2024 22:32

MAINT: sort imports

d1e8320

Signed-off-by: DerWeh <andreas.weh@web.de>

ENH: sklearn compatibility: warn about 2d y

230cddd

Signed-off-by: DerWeh <andreas.weh@web.de>

ENH: sklearn compatibility: use None instead of []

d601d4b

Signed-off-by: DerWeh <andreas.weh@web.de>

ENH: sklearn: add tag that EBM supports NaN inputs

8c5e761

Signed-off-by: DerWeh <andreas.weh@web.de>

ENH: sklearn: check also for inf in labels

c47f89d

Signed-off-by: DerWeh <andreas.weh@web.de>

BLD: bump sklearn dependency

c505014

Signed-off-by: DerWeh <andreas.weh@web.de>

ENH: test sklearn compatibility

d26de0b

Signed-off-by: DerWeh <andreas.weh@web.de>

DerWeh commented Feb 29, 2024

View reviewed changes

python/interpret-core/interpret/glassbox/_ebm/_ebm.py Outdated Show resolved Hide resolved

paulbkoch reviewed Feb 29, 2024

View reviewed changes

python/interpret-core/interpret/glassbox/_ebm/_ebm.py Show resolved Hide resolved

paulbkoch reviewed Feb 29, 2024

View reviewed changes

python/interpret-core/setup.py Show resolved Hide resolved

DerWeh added 4 commits February 29, 2024 20:49

Revert "ENH: sklearn compatibility: warn about 2d y"

9bf0b04

Support of 2d y we `y.shape[-1] = 1` is desired This reverts commit 230cddd. Signed-off-by: DerWeh <andreas.weh@web.de>

ENH: update tags

25ba726

Signed-off-by: DerWeh <andreas.weh@web.de>

TST: update selection of scikit-learn checks

91a9ca2

Signed-off-by: DerWeh <andreas.weh@web.de>

MAINT: drop old skipped scikit-learn checks

f68f25b

Signed-off-by: DerWeh <andreas.weh@web.de>

paulbkoch merged commit bcf86e6 into interpretml:develop Feb 29, 2024
47 checks passed

DerWeh deleted the test_sklearn branch February 29, 2024 23:13

DerWeh mentioned this pull request Mar 4, 2024

TST: loosen requirements for scikit-learn tests #520

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increase compatibility of EBM with scikit-learn #518

Increase compatibility of EBM with scikit-learn #518

DerWeh commented Feb 28, 2024

DerWeh commented Feb 28, 2024

paulbkoch commented Feb 29, 2024 •

edited

DerWeh commented Feb 29, 2024 •

edited

paulbkoch commented Feb 29, 2024

DerWeh commented Feb 29, 2024

paulbkoch commented Feb 29, 2024

Increase compatibility of EBM with scikit-learn #518

Increase compatibility of EBM with scikit-learn #518

Conversation

DerWeh commented Feb 28, 2024

DerWeh commented Feb 28, 2024

paulbkoch commented Feb 29, 2024 • edited

DerWeh commented Feb 29, 2024 • edited

paulbkoch commented Feb 29, 2024

DerWeh commented Feb 29, 2024

paulbkoch commented Feb 29, 2024

paulbkoch commented Feb 29, 2024 •

edited

DerWeh commented Feb 29, 2024 •

edited