PAC-Bayesian Theory Meets Bayesian Inference
–Neural Information Processing Systems
That is, for the negative log-likelihood loss function, we show that the minimization of PAC-Bayesian generalization bounds maximizes the Bayesian marginal likelihood. This provides an alternative explanation to the Bayesian Occam's razor criteria, under the assumption that the data is generated by an i.i.d.
Neural Information Processing Systems
Mar-17-2026, 09:29:20 GMT
- Technology: