Diagnosing AI Explanation Methods with Folk Concepts of Behavior
Jacovi, Alon (Bar Ilan University and Google Research) | Bastings, Jasmijn (Google Research) | Gehrmann, Sebastian (Google Research) | Goldberg, Yoav (Bar Ilan University and the Allen Institute for Artificial Intelligence) | Filippova, Katja (Google Research)
–Journal of Artificial Intelligence Research
We investigate a formalism for the conditions of a successful explanation of AI. We consider "success" to depend not only on what information the explanation contains, but also on what information the human explainee understands from it. Theory of mind literature discusses the folk concepts that humans use to understand and generalize behavior. We posit that folk concepts of behavior provide us with a "language" that humans understand behavior with. We use these folk concepts as a framework of social attribution by the human explainee--the information constructs that humans are likely to comprehend from explanations--by introducing a blueprint for an explanatory narrative (Figure 1) that explains AI behavior with these constructs. We then demonstrate that many XAI methods today can be mapped to folk concepts of behavior in a qualitative evaluation. This allows us to uncover their failure modes that prevent current methods from explaining successfully--i.e., the information constructs that are missing for any given XAI method, and whose inclusion can decrease the likelihood of misunderstanding AI behavior.
Journal of Artificial Intelligence Research
Nov-14-2023
- Country:
- Oceania > Australia
- Victoria > Melbourne (0.04)
- New South Wales > Sydney (0.04)
- North America
- United States
- Maryland (0.04)
- Texas > Travis County
- Austin (0.04)
- New York > New York County
- New York City (0.04)
- Massachusetts
- Suffolk County > Boston (0.04)
- Middlesex County > Cambridge (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Connecticut > New Haven County
- New Haven (0.04)
- California
- San Diego County > San Diego (0.04)
- Los Angeles County > Long Beach (0.04)
- Canada > Alberta
- United States
- Europe
- France (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Sweden > Stockholm
- Stockholm (0.04)
- Italy
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia > China
- Hong Kong (0.04)
- Oceania > Australia
- Genre:
- Overview (0.68)
- Industry:
- Transportation (0.47)
- Health & Medicine (0.46)
- Law (0.46)
- Technology:
- Information Technology
- Human Computer Interaction > Interfaces (0.92)
- Artificial Intelligence
- Robots (1.00)
- Representation & Reasoning (1.00)
- Natural Language > Explanation & Argumentation (1.00)
- Cognitive Science (1.00)
- Issues > Social & Ethical Issues (0.68)
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Information Technology