AITopics | calibration method

Collaborating Authors

calibration method

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

CalArena: A Large-Scale Post-Hoc Calibration Benchmark

Berta, Eugène, Holzmüller, David, Bach, Francis, Jordan, Michael I.

arXiv.org Machine LearningMay-29-2026

Reliable probability estimates are critical in many machine learning applications, yet modern classifiers are often poorly calibrated. Post-hoc calibration provides a simple and widely used solution, but the large number of proposed methods, combined with small-scale and inconsistent evaluations, makes it difficult to determine which approaches are truly effective in practice. We introduce a large-scale, standardized benchmark for post-hoc calibration, covering nearly 2000 experiments across tabular and computer vision tasks, including binary, multiclass, and large-scale classification settings. Our benchmark aggregates predictions from a diverse set of classical models, modern deep learning architectures, and foundation models, and provides unified, reproducible implementations of dozens of calibration methods within a common evaluation framework. We argue that Post-Hoc Improvement (PHI) in proper scoring rules offers a principled alternative to traditional calibration error estimators for comparing post-hoc methods, capturing both calibration quality and potential degradation to the model's predictive performance. Using this framework, we conduct the most comprehensive empirical study of post-hoc calibration to date. Our results reveal consistent patterns across domains: smooth calibration functions outperform binning-based approaches, dedicated multiclass methods are essential in high-dimensional settings, and generic machine learning models are not competitive without calibration-specific design. To facilitate future research, we release all data, code, and evaluation tools, providing a plug-and-play benchmark for developing and comparing calibration methods.

artificial intelligence, calibration, machine learning, (16 more...)

arXiv.org Machine Learning

2605.30188

Country: Europe (0.46)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine (0.67)
Leisure & Entertainment > Games (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Expectation Consistency Loss: Rethink Confidence Calibration under Covariate Shift

Dong, Jinzong, Jiang, Zhaohui, Yang, Bo

arXiv.org Machine LearningMay-22-2026

Confidence calibration for classification models is vital in safety-critical decision-making scenarios and has received extensive attention. General confidence calibration methods assume training and test data are independent and identically distributed, limiting their effectiveness under covariate shifts. Previous calibration methods under covariate shift struggle with class-wise or canonical calibrations and often rely on unstable importance weighting when density ratios are large or unbounded. Given the above limitations, this paper rethinks confidence calibration under covariate shifts. First, we derive a necessary and sufficient condition for confidence calibration under covariate shifts, named Expectation consistency condition, which reveals covariate shifts do not necessarily lead to uncalibrated confidence and provides a weaker condition for confidence calibration than global covariate distribution alignment. Then, utilizing Expectation consistency condition, this paper proposes an unsupervised domain adaptation loss to calibrate confidence of the target domain, named Expectation consistency loss (ECL), which is compatible with canonical calibration, class-wise calibration, and top-label calibration. Third, we prove that computing ECL loss has the same sample complexity as Expected Calibration Error (ECE) and provide a theoretically grounded mini-batch trainable scheme for ECL loss. Finally, we validate the effectiveness of our method on both simulated and real-world covariate shift datasets.

artificial intelligence, calibration, machine learning, (18 more...)

arXiv.org Machine Learning

2605.21552

Country: Asia > China (0.28)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Confidence Calibration of Classifiers with Many Classes

Neural Information Processing SystemsMar-21-2026, 13:49:04 GMT

For classification models based on neural networks, the maximum predicted class probability is often used as a confidence score. This score rarely predicts well the probability of making a correct prediction and requires a post-processing calibration step. However, many confidence calibration methods fail for problems with many classes. To address this issue, we transform the problem of calibrating a multiclass classifier into calibrating a single surrogate binary classifier. This approach allows for more efficient use of standard calibration methods. We evaluate our approach on numerous neural networks used for image or text classification and show that it significantly enhances existing calibration methods.

artificial intelligence, machine learning, proceedings, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.53)

Add feedback

A PID Controller Approach for Adaptive Probability-dependent Gradient Decay in Model Calibration Siyuan Zhang School of Internet of Things Engineering Jiangnan University Wuxi, China 214122 Linbo Xie

Neural Information Processing SystemsFeb-18-2026, 05:42:01 GMT

The code of implementation is available in https://github.com/UHIF/PID_AGD.

calibration, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

Asia > China (0.40)
Europe > France (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology > Smart Houses & Appliances (0.40)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

d826f5aadb26db488b8686097ceea2d1-Paper-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 09:48:49 GMT

artificial intelligence, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
Europe > France (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Therapeutic Area (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Confidence Calibration of Classifiers with Many Classes

Neural Information Processing SystemsFeb-16-2026, 13:39:21 GMT

When such components are expected to be embedded in safety-critical systems (e.g.,

calibration, large language model, machine learning, (20 more...)

Neural Information Processing Systems

Country:

Europe > France (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre:

Research Report > Experimental Study (0.92)
Overview (0.67)

Industry: Energy (0.42)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback

TowardsImprovingCalibrationinObjectDetection UnderDomainShift

Neural Information Processing SystemsFeb-13-2026, 02:27:33 GMT

Unfortunately, very little to no attention is paid towards addressing calibration ofDNN-based visual object detectors, that occupysimilar space and importance inmanydecision making systems astheir visual classification counterparts. In this work, we study the calibration of DNN-based object detection models, particularly under domain shift.

artificial intelligence, calibration, machine learning, (17 more...)

Neural Information Processing Systems

Country: Asia > Pakistan > Punjab > Lahore Division > Lahore (0.04)

Technology: