Optimization and Bayes: A Trade-off for Overparameterized Neural Networks

Neural Information Processing Systems 

KL divergence between the trained posterior distribution obtained by infinitesimal step size gradient descent and a Gaussian prior.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found