Reviews: How To Make the Gradients Small Stochastically: Even Faster Convex and Nonconvex SGD

Oct-7-2024, 21:00:02 GMT–Neural Information Processing Systems

This work studies convergence rates of the gradients for convex composite objectives by combining Nesterov's tricks used for gradient descent with SGD. The authors provide three approaches which differ from each other only slightly and they provide the convergence rates for all the proposed approaches. My comments on this work are as follow: 1. It is indeed important to study convergence rates of gradients especially for non-convex problems. The authors motivate the readers by mentioning this but they assume convexity in their problem set-up.

convergence rate, faster convex and nonconvex sgd, gradient small stochastically, (6 more...)

Neural Information Processing Systems

Oct-7-2024, 21:00:02 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.55)