Aleatoric and Epistemic Discrimination: Fundamental Limits of Fairness Interventions

Jan-18-2025, 09:25:14 GMT–Neural Information Processing Systems

Machine learning (ML) models can underperform on certain population groups due to choices made during model development and bias inherent in the data. We categorize sources of discrimination in the ML pipeline into two classes: aleatoric discrimination, which is inherent in the data distribution, and epistemic discrimination, which is due to decisions made during model development. We quantify aleatoric discrimination by determining the performance limits of a model under fairness constraints, assuming perfect knowledge of the data distribution. We demonstrate how to characterize aleatoric discrimination by applying Blackwell's results on comparing statistical experiments. We then quantify epistemic discrimination as the gap between a model's accuracy when fairness constraints are applied and the limit posed by aleatoric discrimination.

aleatoric and epistemic discrimination, aleatoric discrimination, discrimination, (5 more...)

Neural Information Processing Systems

Jan-18-2025, 09:25:14 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)