A Rigorous Link between Deep Ensembles and (Variational) Bayesian Methods

Neural Information Processing Systems 

We establish the first mathematically rigorous link between Bayesian, variational Bayesian, and ensemble methods. A key step towards this it to reformulate the non-convex optimisation problem typically encountered in deep learning as a convex optimisation in the space of probability measures.