High-dimensional limit theorems for SGD: Momentum and Adaptive Step-sizes

Open in new window