Weight of Evidence as a Basis for Human-Oriented Explanations
Alvarez-Melis, David, Daumé, Hal III, Vaughan, Jennifer Wortman, Wallach, Hanna
–arXiv.org Artificial Intelligence
Interpretability is an elusive but highly sought-after characteristic of modern machine learning methods. Recent work has focused on interpretability via $\textit{explanations}$, which justify individual model predictions. In this work, we take a step towards reconciling machine explanations with those that humans produce and prefer by taking inspiration from the study of explanation in philosophy, cognitive science, and the social sciences. We identify key aspects in which these human explanations differ from current machine explanations, distill them into a list of desiderata, and formalize them into a framework via the notion of $\textit{weight of evidence}$ from information theory. Finally, we instantiate this framework in two simple applications and show it produces intuitive and comprehensible explanations.
arXiv.org Artificial Intelligence
Oct-29-2019
- Country:
- North America
- Canada (0.04)
- United States
- Wisconsin (0.04)
- Maryland (0.04)
- California > Los Angeles County
- Long Beach (0.04)
- Europe > United Kingdom
- England
- Oxfordshire > Oxford (0.04)
- Cambridgeshire > Cambridge (0.04)
- England
- North America
- Genre:
- Research Report (0.40)
- Industry:
- Health & Medicine > Therapeutic Area (1.00)