Sparks of Explainability: Recent Advancements in Explaining Large Vision Models
–arXiv.org Artificial Intelligence
This thesis explores advanced approaches to improve explainability in computer vision by analyzing and modeling the features exploited by deep neural networks. Initially, it evaluates attribution methods, notably saliency maps, by introducing a metric based on algorithmic stability and an approach utilizing Sobol indices, which, through quasi-Monte Carlo sequences, allows a significant reduction in computation time. In addition, the EVA method offers a first formulation of attribution with formal guarantees via verified perturbation analysis. Experimental results indicate that in complex scenarios these methods do not provide sufficient understanding, particularly because they identify only "where" the model focuses without clarifying "what" it perceives. Two hypotheses are therefore examined: aligning models with human reasoning -- through the introduction of a training routine that integrates the imitation of human explanations and optimization within the space of 1-Lipschitz functions -- and adopting a conceptual explainability approach. The CRAFT method is proposed to automate the extraction of the concepts used by the model and to assess their importance, complemented by MACO, which enables their visualization. These works converge towards a unified framework, illustrated by an interactive demonstration applied to the 1000 ImageNet classes in a ResNet model.
arXiv.org Artificial Intelligence
Feb-2-2025
- Country:
- Asia > Russia (0.04)
- South America > Argentina
- Patagonia > Tierra del Fuego Province > Ushuaia (0.04)
- North America
- United States > Illinois
- Cook County > Chicago (0.04)
- Canada > Quebec
- Montreal (0.04)
- United States > Illinois
- Europe
- Genre:
- Workflow (1.00)
- Overview (1.00)
- Instructional Material (1.00)
- Research Report
- Promising Solution (1.00)
- New Finding (1.00)
- Experimental Study (1.00)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Government > Regional Government (1.00)
- Law (0.92)
- Education (0.67)
- Transportation > Ground
- Road (0.67)
- Health & Medicine > Therapeutic Area
- Neurology (0.92)
- Technology:
- Information Technology
- Sensing and Signal Processing > Image Processing (1.00)
- Data Science > Data Mining (1.00)
- Artificial Intelligence
- Vision (1.00)
- Natural Language (1.00)
- Cognitive Science (1.00)
- Issues > Social & Ethical Issues (0.93)
- Representation & Reasoning
- Search (1.00)
- Optimization (1.00)
- Mathematical & Statistical Methods (0.92)
- Machine Learning
- Statistical Learning (1.00)
- Neural Networks > Deep Learning (1.00)
- Information Technology