Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance

May-25-2025, 14:31:16 GMT–Neural Information Processing Systems

Interpretability methods are valuable only if their explanations faithfully describe the explained model. In this work, we consider neural networks whose predictions are invariant under a specific symmetry group. This includes popular architectures, ranging from convolutional to graph neural networks. Any explanation that faithfully explains this type of model needs to be in agreement with this invariance property.

data mining, interpretability method, machine learning, (22 more...)

Neural Information Processing Systems

May-25-2025, 14:31:16 GMT

Conferences PDF

Add feedback

Country:
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.14)
- North America > United States
  - New York > New York County > New York City (0.14)

Industry:
- Health & Medicine (0.93)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning
      - Neural Networks > Deep Learning (1.00)
      - Statistical Learning (1.00)
    - Natural Language (1.00)
    - Representation & Reasoning (1.00)
  - Data Science > Data Mining (0.92)

Duplicate Docs Excel Report

Title
Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance DAMTP University of Cambridge University of Cambridge jc2133@cam.ac.uk mv472@cam.ac.uk

Similar Docs Excel Report more

Title	Similarity	Source
None found