Fairness and Robustness of Contrasting Explanations

Artelt, André, Hammer, Barbara

arXiv.org Artificial Intelligence 

Fairness and explainability are two important and closely related requirements of decision making systems. While fairness and explainability of decision making systems have been extensively studied independently, only little effort has been put into studying fairness of explanations on their own. Current explanations can be unfair to an individual: an example is given by counterfactual explanations which propose different actions to change the output class to two similar individuals. In this work we formally and empirically study individual fairness and its mathematical formalization as robustness for counterfactual explanations as a prominent instance of contrasting explanations. In addition, we propose to use plausible counterfactuals instead of closest counterfactuals for improving the individual fairness of counterfactual explanations.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found