How To Make the Gradients Small Stochastically: Even Faster Convex and Nonconvex SGD

Feb-13-2026, 21:02:09 GMT–Neural Information Processing Systems

However, in terms of making the gradients small, the original SGD does not give an optimal rate, even when f(x) is convex. If f(x) is convex, to find a point with gradient norm ε, we design an algorithm SGD3withanear-optimalrate eO(ε 2),improvingthebestknownrateO(ε 8/3) of [17].

artificial intelligence, convex, machine learning, (18 more...)

Neural Information Processing Systems

Feb-13-2026, 21:02:09 GMT

Conferences PDF

Add feedback

Country:
- North America
  - Canada (0.04)
  - United States > Massachusetts
    - Middlesex County > Cambridge (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Duplicate Docs Excel Report

Title
How To Make the Gradients Small Stochastically: Even Faster Convex and Nonconvex SGD
How To Make the Gradients Small Stochastically: Even Faster Convex and Nonconvex SGD

Similar Docs Excel Report more

Title	Similarity	Source
None found