Momentum-Based Variance Reduction in Non-Convex SGD

Feb-13-2026, 18:55:59 GMT–Neural Information Processing Systems

Variance reduction has emerged in recent years as a strong competitor to stochastic gradient descent in non-convex problems, providing the first algorithms to improve upon the converge rate of stochastic gradient descent for finding first-order critical points. However, variance reduction techniques typically require carefully tuned learning rates and willingness to use excessively large "mega-batches" in order to achieve their improved results.

algorithm, artificial intelligence, machine learning, (13 more...)

Neural Information Processing Systems

Feb-13-2026, 18:55:59 GMT

Conferences PDF

Add feedback

Country:
- North America
  - United States
    - Massachusetts > Suffolk County
      - Boston (0.04)
    - California > Santa Clara County
      - Mountain View (0.04)
  - Canada
    - Ontario > Toronto (0.14)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.97)

Duplicate Docs Excel Report

Title
Momentum-Based Variance Reduction in Non-Convex SGD

Similar Docs Excel Report more

Title	Similarity	Source
None found