Typicalness-Aware Learning for Failure Detection

May-26-2025, 17:02:10 GMT–Neural Information Processing Systems

Deep neural networks (DNNs) often suffer from the overconfidence issue, where incorrect predictions are made with high confidence scores, hindering the applications in critical systems. In this paper, we propose a novel approach called Typicalness-Aware Learning (TAL) to address this issue and improve failure detection performance. We observe that, with the cross-entropy loss, model predictions are optimized to align with the corresponding labels via increasing logit magnitude or refining logit direction. However, regarding atypical samples, the image content and their labels may exhibit disparities. This discrepancy can lead to overfitting on atypical samples, ultimately resulting in the overconfidence issue that we aim to address.To address this issue, we have devised a metric that quantifies the typicalness of each sample, enabling the dynamic adjustment of the logit magnitude during the training process. By allowing relatively atypical samples to be adequately fitted while preserving reliable logit direction, the problem of overconfidence can be mitigated.

artificial intelligence, machine learning, typicalness-aware learning, (6 more...)

Neural Information Processing Systems

May-26-2025, 17:02:10 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.42)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.62)