Eliminating Latent Discrimination: Train Then Mask
Ghili, Soheil, Kazemi, Ehsan, Karbasi, Amin
Nowadays, many sensitive decision-making tasks rely on automated statistical and machine learning algorithms. Examples include targeted advertising, credit scores and loans, college admissions, prediction of domestic violence, and even investment strategies for venture capital groups. There has been a growing concern about errors, unfairness, and transparency of such mechanisms from governments, civil organizations and research societies [2, 33, 40]. That is, whether or not we can prevent discrimination against protected groups and attributes (e.g., race, gender, etc). Clearly, training a machine learning algorithm with the standard aim of loss function minimization (i.e., high accuracy, low prediction error, etc) may result in predictive behaviors that are unfair towards certain groups or individuals [18, 29, 42]. In many real-world applications, we are not allowed to use some sensitive features. For example, EU anti-discrimination law prohibits the use of protected attributes (directly or indirectly) for several decision-making tasks [13]. A naive approach towards fairness is to discard sensitive attributes from training data. However, if other (seemingly) nonsensitive variables are correlated with the protected ones, the learning algorithm may use them to proxy for protected features in order to achieve a lower loss.
Nov-12-2018
- Genre:
- Research Report
- New Finding (0.68)
- Experimental Study (0.46)
- Research Report
- Industry:
- Education > Educational Setting (0.48)
- Banking & Finance > Credit (0.34)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.34)
- Law
- Civil Rights & Constitutional Law (0.34)
- Criminal Law (0.34)
- Health & Medicine > Therapeutic Area
- Psychiatry/Psychology (0.34)
- Technology: