Fair Dummies: Achieving Equalized Odds by Resampling Sensitive Attributes

This package implements "Fair Dummies": a flexible framework [1] for learning predictive models that approximately satisfy the equalized odds notion of fairness. This is achieved by introducing a general discrepancy function that rigorously quantifies violations of this criterion, formulating a differentiable penalty that drives the model parameters towards equalized odds.

To rigorously evaluate fitted models, we also implement a formal hypothesis test to detect when a prediction rule violates the equalized odds property. Both the model fitting and hypothesis testing leverage a resampled version of the sensitive attribute obeying the equalized odds property by construction.

Lastly, we demonstrate how to incorporate techniques for equitable uncertainty quantification---unbiased for each protected group---to precisely communicate the results of the data analysis.

[1] Y. Romano, S. Bates, and E. J. Candès, “Achieving Equalized Odds by Resampling Sensitive Attributes.” Advances in Neural Information Processing Systems (NeurIPS), 2020.

Getting Started

The implementation of [1] is self-contained and written in python.

Usage

Please refer to synthetic_experiment.ipynb for basic usage. The notebooks real_classification_experiment.ipynb and real_regression_experiment.ipynb demonstrate how to use the software package on real data.

Comparisons to competitive methods and additional usage examples of this package can be found in all_classification_experiments.py and all_regression_experiments.py.

Further information and dependencies

This package also implemets:

Adversarial Debiasing [2]: our implementation is based on https://github.com/equialgo/fairness-in-ml
HGR [3]: where our code is based on https://github.com/criteo-research/continuous-fairness

[2] B. H. Zhang, B. Lemoine, and M. Mitchell, "Mitigating unwanted biases with adversarial learning." In Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, pp. 335-340, 2018.

[3] J. Mary, C. Calauzènes, and N. El Karoui, "Fairness-aware learning for continuous attributes and treatments." ICML, 2019

Dependencies:

Conformalized quantile regression (CQR) [4] and equalized coverage [5] frameworks for constructing distribusion-free prediction intervals/sets. Code is avaialable at https://github.com/yromano/cqr
nonconformist package available at https://github.com/donlnz/nonconformist

[4] Y. Romano, E. Patterson, and E. J. Candès, “Conformalized quantile regression.” NeurIPS 2019.

[5] Y. Romano, R. F. Barber, C. Sabbatti and E. J. Candès, “With malice towards none: Assessing uncertainty via equalized coverage.” HDSR 2019.

Prerequisites

python
numpy
scipy
scikit-learn
scikit-garden
pytorch
pandas

Installing

The development version is available here on github:

git clone https://github.com/yromano/fair_dummies.git

Reproducible Research

The code available under synthetic_experiment.ipynb, all_classification_experiments.py, and all_regression_experiments.py in the repository replicates all experimental results in [1].

Publicly Available Datasets

Communities and Crimes: UCI Communities and crime data set.
Nursery: UCI Nursery data set.

Data subject to copyright/usage rules

The Medical Expenditure Panel Survey (MPES) data can be downloaded by following this explanation (code provided by IBM's AIF360).

MEPS_21: Medical expenditure panel survey, panel 21.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
data		data
fair_dummies		fair_dummies
others		others
.DS_Store		.DS_Store
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
all_classification_experiments.py		all_classification_experiments.py
all_regression_experiments.py		all_regression_experiments.py
get_dataset.py		get_dataset.py
make_figure_regularization_effect.py		make_figure_regularization_effect.py
real_classification_experiment.ipynb		real_classification_experiment.ipynb
real_regression_experiment.ipynb		real_regression_experiment.ipynb
regression_experimets_regularization_effect.py		regression_experimets_regularization_effect.py
synthetic_experiment.ipynb		synthetic_experiment.ipynb
synthetic_experiment2_with_A_as_feature.ipynb		synthetic_experiment2_with_A_as_feature.ipynb
synthetic_experiment2_without_A_as_feature.ipynb		synthetic_experiment2_without_A_as_feature.ipynb

License

yromano/fair_dummies

Folders and files

Latest commit

History

Repository files navigation

Fair Dummies: Achieving Equalized Odds by Resampling Sensitive Attributes

Getting Started

Usage

Further information and dependencies

Prerequisites

Installing

Reproducible Research

Publicly Available Datasets

Data subject to copyright/usage rules

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages