Optimization and Bayes: A Trade-off for Overparameterized Neural Networks
–Neural Information Processing Systems
KL divergence between the trained posterior distribution obtained by infinitesimal step size gradient descent and a Gaussian prior.
Neural Information Processing Systems
Feb-9-2026, 11:29:06 GMT
- Country:
- North America > United States
- Maryland > Prince George's County > College Park (0.14)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East
- Jordan (0.04)
- North America > United States