Well File:

 Pascal Germain


PAC-Bayesian Theory Meets Bayesian Inference

Neural Information Processing Systems

That is, for the negative log-likelihood loss function, we show that the minimization of PAC-Bayesian generalization risk bounds maximizes the Bayesian marginal likelihood. This provides an alternative explanation to the Bayesian Occam's razor criteria, under the assumption that the data is generated by an i.i.d.