Multi-dimensional concept discovery (MCD): A unifying framework with completeness guarantees

Vielhaben, Johanna, Blücher, Stefan, Strodthoff, Nils

Jun-18-2023–arXiv.org Artificial Intelligence

For the trustworthy application of XAI, in particular for high-stake decisions, a more global model understanding is required. To this end, concept-based methods have been proposed, which are however not guaranteed to be bound to the actual model reasoning. To circumvent this problem, we propose Multi-dimensional Concept Discovery (MCD) as an extension of previous approaches that fulfills a completeness relation on the level of concepts. Our method starts from general linear subspaces as concepts and does neither require reinforcing concept interpretability nor re-training of model parts. We propose sparse subspace clustering to discover improved concepts and fully leverage the potential of multi-dimensional subspaces. MCD offers two complementary analysis tools for concepts in input space: (1) concept activation maps, that show where a concept is expressed within a sample, allowing for concept characterization through prototypical samples, and (2) concept relevance heatmaps, that decompose the model decision into concept contributions. Both tools together enable a detailed global understanding of the model reasoning, which is guaranteed to relate to the model via a completeness relation. Thus, MCD paves the way towards more trustworthy concept-based XAI. We empirically demonstrate the superiority of MCD against more constrained concept definitions.

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Artificial Intelligence

Jun-18-2023

arXiv.org PDF

Add feedback

Country:
- Europe > France (0.04)
- North America > United States
  - Pennsylvania (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine
  - Diagnostic Medicine > Imaging (0.67)
  - Nuclear Medicine (0.46)

Technology:
- Information Technology
  - Data Science > Data Mining (1.00)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Vision (0.94)
    - Machine Learning
      - Neural Networks > Deep Learning (1.00)
      - Statistical Learning > Clustering (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found