Towards a Unified Framework for Evaluating Explanations

Jul-13-2024–arXiv.org Artificial Intelligence

The challenge of creating interpretable models has been taken up by two main research communities: ML researchers primarily focused on lower-level explainability methods that suit the needs of engineers, and HCI researchers who have more heavily emphasized user-centered approaches often based on participatory design methods. This paper reviews how these communities have evaluated interpretability, identifying overlaps and semantic misalignments. We propose moving towards a unified framework of evaluation criteria and lay the groundwork for such a framework by articulating the relationships between existing criteria. We argue that explanations serve as mediators between models and stakeholders, whether for intrinsically interpretable models or opaque black-box models analyzed via post-hoc techniques. We further argue that useful explanations require both faithfulness and intelligibility. Explanation plausibility is a prerequisite for intelligibility, while stability is a prerequisite for explanation faithfulness. We illustrate these criteria, as well as specific evaluation methods, using examples from an ongoing study of an interpretable neural network for predicting a particular learner behavior.

evaluation, explanation, intelligibility, (17 more...)

arXiv.org Artificial Intelligence

Jul-13-2024

arXiv.org PDF

Add feedback

Country:
- South America > Uruguay
  - Maldonado > Maldonado (0.04)
- North America > United States
  - California (0.04)
  - Texas > Tarrant County
    - Arlington (0.04)
  - New York > New York County
    - New York City (0.04)
  - Illinois > Champaign County
    - Urbana (0.04)

Genre:
- Research Report (1.00)
- Overview (1.00)

Industry:
- Education (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (0.95)
  - Machine Learning > Neural Networks (0.49)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found