Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model

Open in new window