Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Guodong Zhang, Lala Li, Zachary Nado, James Martens, Sushant Sachdeva, George Dahl, Chris Shallue, Roger B. Grosse
–Neural Information Processing Systems
Neural Information Processing Systems
Mar-13-2026, 17:43:18 GMT