Ensuring Fairness Beyond the Training Data

Preliminary Tests

main/Part1_CheckFairness just contains some preliminary tests we did on COMPAS and Adult to see how existing fair classifiers performed vs. standard LogisticRegression. These can be safely ignored.

Examining Marginal Distributions and Perturbations

Can be found in main/Part1_Marginal. In these files, we used existing fair classifiers to check how non-robust they were to weighted perturbations in the data. to visualize this, we plot the marginal distributions of each feature in the data, doing this for Adult and COMPAS. We find that the marginals are almost identical, but fairness is quickly violated. These experiments correspond to Figure 1 in the paper.

Training Robust Classifier

Our robust classifier algorithms (Algorithms 1, 2, and 3) can be found in the main/Part2_MetaAlgo_FTRL directory. The main files here are:

main.py: The file that runs everything, and trains your robust classifier. It will output a single .pkl file that contains the robust classifier in the form of a VotingClassifier object. Hyperparameters can be tweaked using this file and the command line arguments specified in the file. For instance, if I want T_inner = 50 and T = 200 with epsilon = 0.05, I would run: python main.py --T_inner 50 --T 200 --epsilon 0.05 The details/flags available are in the file. Paths to csv files/data are also all in that file.
meta_algo.py: Algorithm 1 in the paper/the outer loop. Everything else gets run from here.
bayesian_oracle.py: Algorithm 3 in the paper/the inner loop, AKA the ApxFair algorithm.
lambda_best_response_param_parallel.py: Algorithm 2 in the paper/the LPs. This is the best response from the Lambda player in the ApxFair algorithm.

Can be safely ignored/helper files:

evaluate_fairness.ipynb: just helps me make some pretty graphs.
voting_classifier.py: the helper file that contains the VotingClassifier class I use to wrap multiple hypotheses into a majority vote classifier. this is called in the functions above, but no need to look into it.

For the exact experiments that we ran in the paper (with tuned hyperparameters), run final_DP_1000.sh and final_EO_500.shfor training robust classifiers on all the train/test splits and all datasets.

Evaluation

The evaluation and comparison of the robustness of our classifier can be done through main/Part3_Comparisons. There is a notebook for each dataset and fairness definition, the saved plots under neurips-plots and all of our trained models in trained_robust. The ensemble_final folder in trained_robust contains all the models we used for our experiments. The h0_final contains the models that are given only by running ApxFair (i.e. one iteration of the inner loop).

Because of file size requirements for subimssion to NeurIPS, we only include the trained models for the first train/test split. However, running final_DP_1000.sh and final_EO_500.sh will duplicate the exact conditions to train the models for all five train/test splits for each dataset and fairness definition.

How To Run

Get off the ground by setting up the basic environment just using the included environment.yml file: conda env create -f environment.yml This should be sufficient to start running main.py. An example run:

python main.py --T_inner 50 --T 200 --epsilon 0.05

There are a lot more tweakable hyperparameters than can be changed through command line; just check the main.py file to see all of those flags. Every hyperparameter corresponds to the symbols used in the paper.

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
data		data
fairness_algo_predictions		fairness_algo_predictions
main		main
part1_results		part1_results
predicted_data		predicted_data
.gitignore		.gitignore
.projectignore		.projectignore
6984-image.png		6984-image.png
README.md		README.md
dataset_adjustments.ipynb		dataset_adjustments.ipynb
environment.yml		environment.yml
explore_datasets.ipynb		explore_datasets.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

fairness_algo_predictions

fairness_algo_predictions

main

main

part1_results

part1_results

predicted_data

predicted_data

.gitignore

.gitignore

.projectignore

.projectignore

6984-image.png

6984-image.png

README.md

README.md

dataset_adjustments.ipynb

dataset_adjustments.ipynb

environment.yml

environment.yml

explore_datasets.ipynb

explore_datasets.ipynb

Repository files navigation

Ensuring Fairness Beyond the Training Data

Preliminary Tests

Examining Marginal Distributions and Perturbations

Training Robust Classifier

Evaluation

How To Run

About

Releases

Packages

Contributors 2

Languages

samuel-deng/Ensuring-Fairness-Beyond-the-Training-Data

Folders and files

Latest commit

History

Repository files navigation

Ensuring Fairness Beyond the Training Data

Preliminary Tests

Examining Marginal Distributions and Perturbations

Training Robust Classifier

Evaluation

How To Run

About

Resources

Stars

Watchers

Forks

Languages