Causal Explanations for Image Classifiers

Chockler, Hana, Kelly, David A., Kroening, Daniel, Sun, Youcheng

Nov-13-2024–arXiv.org Artificial Intelligence

Existing algorithms for explaining the output of image classifiers use different definitions of explanations and a variety of techniques to extract them. However, none of the existing tools use a principled approach based on formal definitions of causes and explanations for the explanation extraction. In this paper we present a novel black-box approach to computing explanations grounded in the theory of actual causality. We prove relevant theoretical results and present an algorithm for computing approximate explanations based on these definitions. We prove termination of our algorithm and discuss its complexity and the amount of approximation compared to the precise definition. We implemented the framework in a tool rex and we present experimental results and a comparison with state-of-the-art tools. We demonstrate that rex is the most efficient tool and produces the smallest explanations, in addition to outperforming other black-box tools on standard quality measures.

explanation, responsibility, superpixel, (14 more...)

arXiv.org Artificial Intelligence

Nov-13-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Minnesota (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
- Europe > United Kingdom
  - England
    - Oxfordshire > Oxford (0.14)
    - Greater Manchester > Manchester (0.04)
    - Greater London > London (0.04)
- Asia > Middle East
  - Jordan (0.04)
  - UAE (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Information Technology (0.67)
- Transportation (0.55)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Representation & Reasoning (1.00)
  - Machine Learning > Neural Networks (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found