On Variance Reduction in Stochastic Gradient Descent and its Asynchronous Variants

Mar-13-2024, 03:44:52 GMT–Neural Information Processing Systems

We study optimization algorithms based on variance reduction for stochastic gradient descent (SGD). Remarkable recent progress has been made in this direction through development of algorithms like SAG, SVRG, SAGA. These algorithms have been shown to outperform SGD, both theoretically and empirically. However, asynchronous versions of these algorithms--a crucial requirement for modern large-scale applications--have not been studied.

algorithm, asynchronous variant, iteration, (12 more...)

Neural Information Processing Systems

Mar-13-2024, 03:44:52 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Pennsylvania > Allegheny County
    - Pittsburgh (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Statistical Learning
    - Gradient Descent (1.00)