Stochastic Gradient Descent, Weighted Sampling, and the Randomized Kaczmarz algorithm

Mar-13-2024, 14:01:18 GMT–Neural Information Processing Systems

We improve a recent guarantee of Bach and Moulines on the linear convergence of SGD for smooth and strongly convex objectives, reducing a quadratic dependence on the strong convexity to a linear dependence. Furthermore, we show how reweighting the sampling distribution (i.e.

convergence, dependence, linear dependence, (16 more...)

Neural Information Processing Systems

Mar-13-2024, 14:01:18 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Texas > Travis County
    - Austin (0.04)
  - Pennsylvania > Philadelphia County
    - Philadelphia (0.04)
  - Illinois > Cook County
    - Chicago (0.04)
  - California > Los Angeles County
    - Claremont (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.87)