Disentangling the Roles of Curation, Data-Augmentation and the Prior in the Cold Posterior Effect Kevin Roth

May-29-2025, 02:57:16 GMT–Neural Information Processing Systems

The "cold posterior effect" (CPE) in Bayesian deep learning describes the uncomforting observation that the predictive performance of Bayesian neural networks can be significantly improved if the Bayes posterior is artificially sharpened using a temperature parameter T < 1. The CPE is problematic in theory and practice and since the effect was identified many researchers have proposed hypotheses to explain the phenomenon. However, despite this intensive research effort the effect remains poorly understood. In this work we provide novel and nuanced evidence relevant to existing explanations for the cold posterior effect, disentangling three hypotheses: 1. The dataset curation hypothesis of Aitchison (2020): we show empirically that the CPE does not arise in a real curated data set but can be produced in a controlled experiment with varying curation strength.

artificial intelligence, data augmentation, machine learning, (15 more...)

Neural Information Processing Systems

May-29-2025, 02:57:16 GMT

Conferences PDF

Add feedback

Country:
- Europe (0.14)
- North America > Canada
  - Ontario > Toronto (0.14)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (0.68)
    - Neural Networks > Deep Learning (0.91)
    - Statistical Learning (1.00)
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.94)