Fast Last-Iterate Convergence of SGD in the Smooth Interpolation Regime

Jun-13-2026, 09:46:50 GMT–Neural Information Processing Systems

We study population convergence guarantees of stochastic gradient descent (SGD) for smooth convex objectives in the interpolation regime, where the noise at optimum is zero or near zero. The behavior of the last iterate of SGD in this setting---particularly with large (constant) stepsizes---has received growing attention in recent years due to implications for the training of over-parameterized models, as well as to analyzing forgetting in continual learning and to understanding the convergence of the randomized Kaczmarz method for solving linear systems.

artificial intelligence, machine learning, proceedings, (9 more...)

Neural Information Processing Systems

Jun-13-2026, 09:46:50 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.62)