PAC-Bayesian Theory Meets Bayesian Inference
Pascal Germain, Francis Bach, Alexandre Lacoste, Simon Lacoste-Julien
–Neural Information Processing Systems
That is, for the negative log-likelihood loss function, we show that the minimization of PAC-Bayesian generalization risk bounds maximizes the Bayesian marginal likelihood. This provides an alternative explanation to the Bayesian Occam's razor criteria, under the assumption that the data is generated by an i.i.d.
Neural Information Processing Systems
Jan-20-2025, 14:46:04 GMT