CRAFT: Concept Recursive Activation FacTorization for Explainability
Fel, Thomas, Picard, Agustin, Bethune, Louis, Boissin, Thibaut, Vigouroux, David, Colin, Julien, Cadène, Rémi, Serre, Thomas
–arXiv.org Artificial Intelligence
Attribution methods, which employ heatmaps to identify the most influential regions of an image that impact model decisions, have gained widespread popularity as a type of explainability method. However, recent research has exposed the limited practical value of these methods, attributed in part to their narrow focus on the most prominent regions of an image -- revealing "where" the model looks, but failing to elucidate "what" the model sees in those areas. In this work, we try to fill in this gap with CRAFT -- a novel approach to identify both "what" and "where" by generating concept-based explanations. We introduce 3 new ingredients to the automatic concept extraction literature: (i) a recursive strategy to detect and decompose concepts across layers, (ii) a novel method for a more faithful estimation of concept importance using Sobol indices, and (iii) the use of implicit differentiation to unlock Concept Attribution Maps. We conduct both human and computer vision experiments to demonstrate the benefits of the proposed approach. We show that the proposed concept importance estimation technique is more faithful to the model than previous methods. When evaluating the usefulness of the method for human experimenters on a human-centered utility benchmark, we find that our approach significantly improves on two of the three test scenarios. Our code is freely available at github.com/deel-ai/Craft.
arXiv.org Artificial Intelligence
Mar-28-2023
- Country:
- Genre:
- Research Report
- Promising Solution (0.68)
- New Finding (0.46)
- Research Report
- Industry:
- Transportation (0.47)
- Law (0.46)
- Information Technology > Security & Privacy (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Vision (1.00)
- Natural Language (1.00)
- Representation & Reasoning (0.93)
- Issues > Social & Ethical Issues (0.66)
- Machine Learning
- Neural Networks > Deep Learning (0.67)
- Statistical Learning (0.67)
- Information Technology > Artificial Intelligence