Confidence Estimation via Auxiliary Models
Corbière, Charles, Thome, Nicolas, Saporta, Antoine, Vu, Tuan-Hung, Cord, Matthieu, Pérez, Patrick
Reliably quantifying the confidence of deep neural classifiers is a challenging yet fundamental requirement for deploying such models in safety-critical applications. In this paper, we introduce a novel target criterion for model confidence, namely the true class probability (TCP). We show that TCP offers better properties for confidence estimation than standard maximum class probability (MCP). Since the true class is by essence unknown at test time, we propose to learn TCP criterion from data with an auxiliary model, introducing a specific learning scheme adapted to this context. We evaluate our approach on the task of failure prediction and of self-training with pseudo-labels for domain adaptation, which both necessitate effective confidence estimates. Extensive experiments are conducted for validating the relevance of the proposed approach in each task. We study various network architectures and experiment with small and large datasets for image classification and semantic segmentation. In every tested benchmark, our approach outperforms strong baselines.
Dec-11-2020
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Automobiles & Trucks (0.46)
- Health & Medicine > Diagnostic Medicine (0.46)
- Information Technology (0.67)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Learning Graphical Models > Directed Networks
- Bayesian Learning (0.46)
- Neural Networks > Deep Learning (1.00)
- Performance Analysis > Accuracy (1.00)
- Statistical Learning (1.00)
- Learning Graphical Models > Directed Networks
- Natural Language (0.93)
- Representation & Reasoning (1.00)
- Vision (1.00)
- Machine Learning
- Sensing and Signal Processing > Image Processing (1.00)
- Artificial Intelligence
- Information Technology