Interpretability Benchmark for Evaluating Spatial Misalignment of Prototypical Parts Explanations

Sacha, Mikołaj, Jura, Bartosz, Rymarczyk, Dawid, Struski, Łukasz, Tabor, Jacek, Zieliński, Bartosz

Aug-16-2023–arXiv.org Artificial Intelligence

Prototypical parts-based networks are becoming increasingly popular due to their faithful self-explanations. However, their similarity maps are calculated in the penultimate network layer. Therefore, the receptive field of the prototype activation region often depends on parts of the image outside this region, which can lead to misleading interpretations. We name this undesired behavior a spatial explanation misalignment and introduce an interpretability benchmark with a set of dedicated metrics for quantifying this phenomenon. In addition, we propose a method for misalignment compensation and apply it to existing state-of-the-art models. We show the expressiveness of our benchmark and the effectiveness of the proposed compensation methodology through extensive empirical studies.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

Aug-16-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.04)
- Europe
  - Poland > Lesser Poland Province
    - Kraków (0.04)
  - France > Auvergne-Rhône-Alpes
    - Isère > Grenoble (0.04)
- Asia > Middle East
  - Israel > Tel Aviv District > Tel Aviv (0.04)

Genre:
- Research Report (0.84)

Industry:
- Health & Medicine > Diagnostic Medicine > Imaging (0.46)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (0.95)
  - Artificial Intelligence
    - Representation & Reasoning (0.93)
    - Vision (0.69)
    - Machine Learning > Neural Networks
      - Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found