76c073d8a82d9ddaf993300be03ac70f-Paper.pdf
–Neural Information Processing Systems
We prove the O(t 1/2) rate of convergence for the squared norm of the gradient of Moreau envelope, which is the standard stationarity measure for this class of problems. It matches the known rates that adaptive algorithms enjoy for the specific case ofunconstrained smoothnonconvexstochastic optimization.
Neural Information Processing Systems
Feb-9-2026, 10:23:37 GMT
- Technology: