Adversarial Debiasing for Model Fairness

This project implements adversarial debiasing to help increase model fairness in the DiCE (Diverse Counterfactual Explanations) framework.

Overview

Adversarial debiasing is a technique where a model is trained to be both accurate in its predictions and fair with respect to protected attributes (e.g., gender, race) by using an adversarial approach. The implementation includes:

A standard model (baseline)
An adversarially debiased model
Evaluation of fairness metrics
Analysis of model performance
Assessment of recourse feasibility

Files

adversarial_debiasing.py: Complete implementation of adversarial debiasing with all analyses
train_adversarial_debiasing.py: Script to train a model with adversarial debiasing
evaluate_fairness.py: Script to evaluate fairness metrics
compare_performance.py: Script to compare model performance
analyze_recourse.py: Script to analyze recourse feasibility
run_all.py: Script to run all analyses in sequence

Requirements

Python 3.7+
TensorFlow 2.x
NumPy
Pandas
Matplotlib
Seaborn
Scikit-learn
DiCE

Usage

You can run the complete analysis with:

python adversarial_debiasing.py

Or run individual scripts:

python train_adversarial_debiasing.py
python evaluate_fairness.py
python compare_performance.py
python analyze_recourse.py

Or run all scripts in sequence:

python run_all.py

Metrics Evaluated

Fairness Metrics

Demographic Parity Difference
Equal Opportunity Difference
Disparate Impact Ratio
Equalized Odds Difference

Performance Metrics

Accuracy
Precision
Recall
F1 Score
ROC AUC

Recourse Feasibility Metrics

Success Rate
Average Number of Changes
Average Distance

Results

The analysis generates several plots in the plots directory:

fairness_metrics_comparison.png: Comparison of fairness metrics
performance_metrics_comparison.png: Comparison of model performance
accuracy_by_gender.png: Analysis of accuracy by gender
recourse_metrics_comparison.png: Comparison of recourse feasibility metrics

Datasets

The implementation uses the UCI Adult Income dataset, which predicts whether income exceeds $50K/yr based on census data, with gender as the protected attribute.

Name		Name	Last commit message	Last commit date
Latest commit History 786 Commits
.github/workflows		.github/workflows
dice_env		dice_env
dice_ml		dice_ml
docs		docs
examples		examples
plots		plots
tests		tests
.flake8		.flake8
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
README.rst		README.rst
adult.zip		adult.zip
adversarial_debiasing.py		adversarial_debiasing.py
adversarial_debiasing_example.ipynb		adversarial_debiasing_example.ipynb
adversarial_debiasing_implementation_plan.md		adversarial_debiasing_implementation_plan.md
analyze_recourse.py		analyze_recourse.py
compare_performance.py		compare_performance.py
environment-deeplearning.yml		environment-deeplearning.yml
environment.yml		environment.yml
evaluate_fairness.py		evaluate_fairness.py
requirements-deeplearning.txt		requirements-deeplearning.txt
requirements-linting.txt		requirements-linting.txt
requirements-test.txt		requirements-test.txt
requirements.txt		requirements.txt
run_all.py		run_all.py
setup.cfg		setup.cfg
setup.py		setup.py
train_adversarial_debiasing.py		train_adversarial_debiasing.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adversarial Debiasing for Model Fairness

Overview

Files

Requirements

Usage

Metrics Evaluated

Fairness Metrics

Performance Metrics

Recourse Feasibility Metrics

Results

Datasets

About

Releases

Packages

Languages

License

oviyaseeniraj/DiCE

Folders and files

Latest commit

History

Repository files navigation

Adversarial Debiasing for Model Fairness

Overview

Files

Requirements

Usage

Metrics Evaluated

Fairness Metrics

Performance Metrics

Recourse Feasibility Metrics

Results

Datasets

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages