Towards Understanding The Calibration Benefits of Sharpness-Aware Minimization

Tan, Chengli, Zhou, Yubo, Ye, Haishan, Dai, Guang, Liu, Junmin, Song, Zengjie, Zhang, Jiangshe, Zhao, Zixiang, Hao, Yunda, Xu, Yong

Jun-2-2025–arXiv.org Artificial Intelligence

Deep neural networks have been increasingly used in safety-critical applications such as medical diagnosis and autonomous driving. However, many studies suggest that they are prone to being poorly calibrated and have a propensity for overconfidence, which may have disastrous consequences. In this paper, unlike standard training such as stochastic gradient descent, we show that the recently proposed sharpness-aware minimization (SAM) counteracts this tendency towards overconfidence. The theoretical analysis suggests that SAM allows us to learn models that are already well-calibrated by implicitly maximizing the entropy of the predictive distribution. Inspired by this finding, we further propose a variant of SAM, coined as CSAM, to ameliorate model calibration. Extensive experiments on various datasets, including ImageNet-1K, demonstrate the benefits of SAM in reducing calibration error. Meanwhile, CSAM performs even better than SAM and consistently achieves lower calibration error than other approaches

artificial intelligence, calibration, machine learning, (18 more...)

arXiv.org Artificial Intelligence

Jun-2-2025

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.84)

Industry:
- Health & Medicine (0.48)
- Information Technology > Robotics & Automation (0.34)
- Automobiles & Trucks (0.34)
- Transportation > Ground
  - Road (0.34)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning > Gradient Descent (0.54)
  - Neural Networks > Deep Learning (0.49)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found