AdaGrad stepsizes: Sharp convergence over nonconvex landscapes, from any initialization

Open in new window