Last iterate convergence of SGD for Least-Squares in the Interpolation regime

Neural Information Processing Systems 

Hence, the main question: how would SGD profit from this noiseless model?