Provable concept learning for interpretable predictions using variational autoencoders
Taeb, Armeen, Ruggeri, Nicolo, Schnuck, Carina, Yang, Fanny
–arXiv.org Artificial Intelligence
In safety-critical applications, practitioners are reluctant to trust neural networks when no interpretable explanations are available. Many attempts to provide such explanations revolve around pixel-based attributions or use previously known concepts. In this paper we aim to provide explanations by provably identifying \emph{high-level, previously unknown ground-truth concepts}. To this end, we propose a probabilistic modeling framework to derive (C)oncept (L)earning and (P)rediction (CLAP) -- a VAE-based classifier that uses visually interpretable concepts as predictors for a simple classifier. Assuming a generative model for the ground-truth concepts, we prove that CLAP is able to identify them while attaining optimal classification accuracy. Our experiments on synthetic datasets verify that CLAP identifies distinct ground-truth concepts on synthetic datasets and yields promising results on the medical Chest X-Ray dataset.
arXiv.org Artificial Intelligence
Jul-22-2022
- Country:
- Europe
- Switzerland > Zürich
- Zürich (0.14)
- Germany > Baden-Württemberg
- Tübingen Region > Tübingen (0.04)
- Switzerland > Zürich
- Europe
- Genre:
- Research Report (1.00)
- Industry:
- Health & Medicine > Diagnostic Medicine > Imaging (0.67)
- Technology: