Counterfactual Fairness by Combining Factual and Counterfactual Predictions
Zhou, Zeyu, Liu, Tianci, Bai, Ruqi, Gao, Jing, Kocaoglu, Murat, Inouye, David I.
–arXiv.org Artificial Intelligence
In high-stake domains such as healthcare and hiring, the role of machine learning (ML) in decision-making raises significant fairness concerns. This work focuses on Counterfactual Fairness (CF), which posits that an ML model's outcome on any individual should remain unchanged if they had belonged to a different demographic group. Previous works have proposed methods that guarantee CF. Notwithstanding, their effects on the model's predictive performance remains largely unclear. To fill in this gap, we provide a theoretical study on the inherent trade-off between CF and predictive performance in a model-agnostic manner. We first propose a simple but effective method to cast an optimal but potentially unfair predictor into a fair one without losing the optimality. By analyzing its excess risk in order to achieve CF, we quantify this inherent trade-off. Further analysis on our method's performance with access to only incomplete causal knowledge is also conducted. Built upon it, we propose a performant algorithm that can be applied in such scenarios. Experiments on both synthetic and semi-synthetic datasets demonstrate the validity of our analysis and methods.
arXiv.org Artificial Intelligence
Sep-3-2024
- Country:
- Europe
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Spain > Catalonia
- North America > United States
- Hawaii > Honolulu County > Honolulu (0.04)
- Europe
- Genre:
- Research Report > New Finding (0.93)
- Industry:
- Health & Medicine (0.48)
- Technology: