How good is PAC-Bayes at explaining generalisation?

Picard-Weibel, Antoine, Clerico, Eugenio, Moscoviz, Roman, Guedj, Benjamin

Mar-11-2025–arXiv.org Machine Learning

The widespread use of modern neural networks for high-stakes applications requires safety guarantees on their performance on future data, which have not been observed during the training [Xu and Goodacre, 2018, Russell and Norvig, 2020]. A well-established approach to train and evaluate the performance of a predictor consists of the following steps. First, the available data are split in a train and a test datasets. The training data are used to construct the predictor, whose performance is then assessed on the test data (empirical test risk). Finally, concentration inequalities [Boucheron et al., 2013] are used to derive, from this finite-sample test, an upper bound on the model expected performance over the data distribution (population risk) [Langford, 2005].

artificial intelligence, machine learning, pac-bayes, (15 more...)

arXiv.org Machine Learning

Mar-11-2025

arXiv.org PDF

Add feedback

Country:
- Europe > United Kingdom
  - England (0.14)
- North America > United States (0.28)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks (1.00)
  - Statistical Learning (1.00)