WIP: threshold optimizer with relaxed fairness constraint fulfillment #1248

AndreFCruz · 2023-06-28T15:16:01Z

Description

This PR corresponds to Issue #1246 (discussion is ongoing over there).

General summary: implementing ThresholdOptimizer with relaxed fairness constraint fulfillment.

Opened PR as a Draft to ease code discussions on the ongoing implementation.

Tests

no new tests required
new tests added
existing tests adjusted
new tests needed (TODO)

Documentation

no documentation changes needed
user guide added or updated
API docs added or updated
example notebook added or updated

TODO

convert asserts to proper error messages
harmonize API with sklearn classifiers and ThresholdOptimizer constructor
compatibility with standard inputs for X, Y, S (pd.DF, pd.Series, np.ndarray, etc.)
compatibility with non-callable predictors
- May get fixed if we can switch to using the InterpolatedThresholder instead of our custom RandomizedClassifier implementations.
decide on API for RelaxedThresholdOptimizer (same or different class as ThresholdOptimizer)
add tests
add code documentation (classes and types are already documented)

Consolidating code with existing code-base

The _randomized_classifiers.py module provides two main functionalities:

RandomizedClassifier: Constructing a randomized classifier at a given ROC point. This implies triangulating the target ROC point as a linear combination of realized (deterministic) ROC points in the realized ROC curve.
- That is, to achieve a point in the interior of the ROC curve, we may need to use up to three realized points (each realized point uses a specific deterministic threshold). The final classifier is a randomized classifier (a classifier with a randomized threshold).
EnsembleGroupwiseClassifiers: Bringing all group-specific classifiers together under a single classifier object.

Part (or all?) of this functionality seems to be covered by the InterpolatedThresholder class.

Are there any examples of using an InterpolatedThresholder object to triangulate an ROC point?

What other functionality do you reckon could be duplicated?

@romanlutz @MiroDudik

…ns/error-parity

romanlutz

I do realize this is still WIP but I thought I'd share some early thoughts 🙂

We'll also need to think about compatibility with the existing plotting code:

fairlearn/fairlearn/postprocessing/_plotting.py

Line 74 in 476db8c

def plot_threshold_optimizer(threshold_optimizer, ax=None, show_plot=True):

Are there any reasonable plots you generate for the relaxed version?

romanlutz · 2023-06-28T16:23:28Z

fairlearn/postprocessing/_threshold_operation.py

@@ -18,7 +18,7 @@ class ThresholdOperation:
    """

    def __init__(self, operator, threshold):
-        if operator not in [">", "<"]:
+        if operator not in [">", "<"]:          # NOTE for PR: sklearn uses >= for ROC threshold; see: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.roc_curve.html


Interesting! The way we use it, it shouldn't really be crucial if it's >= or >. We can probably change it. The thresholds are always chosen between the input scores, e.g., between 0.5 and 0.6 we choose 0.55 so they're meant to be strictly on one side or the other. I would have to make sure that it doesn't change anything if we want to make a change here, though. Does this relate to your PR in some way?

I think the thresholds in sklearn are the predicted scores for one or more instances (see code here), so using > instead of >= would mean flipping the predictions for some portion of samples.

If it's possible to change it here that would be great!

If not, is there any fairlearn code to build the (fpr, tpr, threshold) ROC triplets? I could switch to using that.

Take a look at _tradeoff_curve here:
https://github.com/fairlearn/fairlearn/blob/main/fairlearn/postprocessing/_tradeoff_curve_utilities.py

fairlearn/postprocessing/_cvxpy_threshold_optimizer.py

romanlutz · 2023-06-28T16:55:45Z

fairlearn/postprocessing/_cvxpy_threshold_optimizer.py

+
+    Parameters
+    ----------
+    estimator : object


Are you assuming that it's pre-fit (trained) or not? estimator kind of implies that it's not, while predictor (in sklearn) means it is pre-fit. We have some logic and an argument in ThresholdOptimizer to check for this:

fairlearn/fairlearn/postprocessing/_threshold_optimizer.py

Line 168 in 476db8c

prefit : bool, default=False

Obviously, when adding this functionality to the TO class you can take advantage of that and make sure you pass only a trained model in. In that case, you may want to call this predictor (?) but perhaps I'm splitting hairs now.

fairlearn/postprocessing/_cvxpy_threshold_optimizer.py

romanlutz · 2023-06-28T18:12:03Z

fairlearn/postprocessing/_cvxpy_threshold_optimizer.py

+
+        # Compute group-wise ROC curves
+        if y_scores is None:
+            y_scores = self.estimator(X)


We're doing this a bit differently in TO and I would argue it's preferable:

fairlearn/fairlearn/postprocessing/_threshold_optimizer.py

Line 334 in 476db8c

scores = _get_soft_predictions(self.estimator_, X, self._predict_method)

ah right, that was still using the previous callable API.

By the way, we can of course omit the y_scores here, it's just here because this is the slowest part of the whole fit method and oftentimes users already have the predictions computed.

An example is when you want to map the whole fairness-accuracy Pareto frontier, you'd call RelaxedThresholdOptimizer on a bunch of tolerance values (e.g., np.arange(0, 1, 1e-2)), and each call would unnecessarily re-compute predictions. This is in fact the only part of the fit method that scales with the size of the dataset, everything else is generally fast on large datasets.

I get your point about performance, but I'm not entirely sure why a user would precompute them just to pass them in manually, rather than having the class compute the scores. Or do you mean that they need the scores for other purposes, so they have to compute them outside, so computing them twice feels like a waste?

hmm, I don't have a strong opinion on how exactly it is implemented, but I think it would be useful to somehow cache these predictions so we don't have to re-compute them for different levels of tolerance. Given that different values of tolerance will be handled by different class objects, I don't see how it could be cached at the class level (must be cached outside). Example follows:

When mapping the fairness-accuracy Pareto frontier, the user needs to create a different RelaxedThresholdOptimizer for each tolerance value, e.g.:

# Compute predictions for models with varying levels of tolerance def compute_test_predictions_with_relaxed_constraints(tolerance: float, y_scores=None) -> np.ndarray: # Instantiate clf = _RelaxedThresholdOptimizer( predictor=lambda *args, **kwargs: unmitigated_predictor.predict(*args, **kwargs), predict_method="__call__", constraint="equalized_odds", tolerance=tolerance, ) # NOTE # 1st Option: will call `predictor.predict(X_train)` clf.fit(X_train, Y_train, sensitive_features=A_train_np) # 2nd Option: will *not* call `predictor.predict(X_train)` clf.fit(X_train, Y_train, sensitive_features=A_train_np, y_scores=y_scores) return clf.predict(X_test, sensitive_features=A_test_np) # [For 2nd Option] Pre-compute predictions Y_train_scores = unmitigated_predictor.predict(X_train) # Compute predictions at different levels of tolerance all_model_predictions = { f"train tolerance={tol:.1}": compute_test_predictions_with_relaxed_constraints(tol, y_scores=Y_train_scores) for tol in np.arange(0, unmitigated_equalized_odds_diff, 1e-2) }

Given that the underlying predictor is the same on all of these objects, the clf.fit(.) will call predictor.predict_proba(.) repeatedly (and redundantly) for each tolerance value. The 1st option calls the predictor's predict_method $n$ times (once per call of the helper function), while the 2nd option only calls it once (outside the function).

PS: ignore the predict_method="__call__" mess for now 😅

fairlearn/postprocessing/_cvxpy_threshold_optimizer.py

romanlutz · 2023-06-28T18:18:21Z

fairlearn/postprocessing/_randomized_classifiers.py

@@ -0,0 +1,461 @@
+"""Helper functions to construct and use randomized classifiers.
+
+TODO: this module will probably be substituted by the InterpolatedThresholder


If we need to make adjustments to InterpolatedThresholder let's do it (unless it's too complicated)

Co-authored-by: Roman Lutz <romanlutz13@gmail.com>

AndreFCruz · 2023-06-29T10:11:42Z

Thanks for the feedback!
I addressed most comments, and will be going through the remaining points throughout the week.

…mentations

romanlutz · 2023-07-06T17:00:47Z

Are there any reasonable plots you generate for the relaxed version?

Answering my own question here: I saw the relaxation plot (below on the right side) in the paper.

Do you have plans to add that plot? I think it's super insightful, especially if you're familiar with TO. We already have plotting code for TO, so perhaps it wouldn't be too hard to extend it. No pressure, though 🙂 We just need an informative message if someone tries to pass the relaxed TO to the plotting function in case we don't add the plotting code just yet.

AndreFCruz · 2023-07-07T09:37:20Z

Regarding the plot, I can add that exact plotting function from Figure 9 as implemented here.

For now, I wanted to get the core working and ready to merge, the plotting is somewhat independent of the rest. I'll add the check for when this class is passed to the current plotting function.

… etc.

romanlutz · 2023-07-07T14:36:52Z

Regarding the plot, I can add that exact plotting function from Figure 9 as implemented here.

For now, I wanted to get the core working and ready to merge, the plotting is somewhat independent of the rest. I'll add the check for when this class is passed to the current plotting function.

I completely agree.

Initial commit: bringing code from https://github.com/socialfoundatio…

dd28ead

…ns/error-parity

AndreFCruz marked this pull request as draft June 28, 2023 15:16

romanlutz reviewed Jun 28, 2023

View reviewed changes

romanlutz linked an issue Jun 28, 2023 that may be closed by this pull request

ENH relaxed fairness constraint fulfillment via postprocessing (currently only supports strict fulfillment) #1246

Open

romanlutz added enhancement New feature or request API Anything which touches on the API labels Jun 28, 2023

romanlutz assigned AndreFCruz Jun 28, 2023

AndreFCruz and others added 2 commits June 29, 2023 09:41

Update fairlearn/postprocessing/_cvxpy_threshold_optimizer.py

67adac7

Co-authored-by: Roman Lutz <romanlutz13@gmail.com>

addressing PR feedback

7155e44

AndreFCruz added 4 commits June 29, 2023 14:26

making RelaxedThesholdOptimizer extensible to future constraint imple…

2bb9a76

…mentations

added cvxpy solutions for TPR,FPR,TNR,FNR constraints

23f879e

code now runs fine

6009ffe

corrected bug on TPR/FPR parity constraints

9435885

AndreFCruz force-pushed the andrefcruz/relaxed-postprocessing branch 3 times, most recently from 6498c81 to 07ffdb8 Compare July 4, 2023 17:00

fixed linting errors

b6ada3e

AndreFCruz force-pushed the andrefcruz/relaxed-postprocessing branch from dbde72a to b6ada3e Compare July 6, 2023 12:52

added draft example for relaxed thresholding

942252c

AndreFCruz added 4 commits July 7, 2023 10:39

updated example

1a071c2

added plotting safe-guard

50e8fde

Relaxed TO now compatible with standard input formats: pandas, numpy,…

7510e56

… etc.

minor bug fix in Relaxed TO example

f219cea

hildeweerts mentioned this pull request Oct 17, 2023

Add user guide for fairlearn.postprocessing.ThresholdOptimizer #614

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: threshold optimizer with relaxed fairness constraint fulfillment #1248

WIP: threshold optimizer with relaxed fairness constraint fulfillment #1248

AndreFCruz commented Jun 28, 2023 •

edited

romanlutz left a comment

romanlutz Jun 28, 2023

AndreFCruz Jun 29, 2023

MiroDudik Jun 29, 2023

romanlutz Jun 28, 2023

romanlutz Jun 28, 2023

AndreFCruz Jun 29, 2023

AndreFCruz Jun 29, 2023

romanlutz Jul 6, 2023

AndreFCruz Jul 7, 2023

romanlutz Jun 28, 2023

AndreFCruz commented Jun 29, 2023

romanlutz commented Jul 6, 2023

AndreFCruz commented Jul 7, 2023

romanlutz commented Jul 7, 2023

		@@ -0,0 +1,461 @@
		"""Helper functions to construct and use randomized classifiers.

		TODO: this module will probably be substituted by the InterpolatedThresholder

WIP: threshold optimizer with relaxed fairness constraint fulfillment #1248

Are you sure you want to change the base?

WIP: threshold optimizer with relaxed fairness constraint fulfillment #1248

Conversation

AndreFCruz commented Jun 28, 2023 • edited

Description

Tests

Documentation

TODO

Consolidating code with existing code-base

romanlutz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AndreFCruz commented Jun 29, 2023

romanlutz commented Jul 6, 2023

AndreFCruz commented Jul 7, 2023

romanlutz commented Jul 7, 2023

AndreFCruz commented Jun 28, 2023 •

edited