Stochastic Gradient Descent, Weighted Sampling, and the Randomized Kaczmarz algorithm

Jan-18-2025, 11:46:18 GMT–Neural Information Processing Systems

We improve a recent gurantee of Bach and Moulines on the linear convergence of SGD for smooth and strongly convex objectives, reducing a quadratic dependence on the strong convexity to a linear dependence. Furthermore, we show how reweighting the sampling distribution (i.e. Our results are based on a connection we make between SGD and the randomized Kaczmarz algorithm, which allows us to transfer ideas between the separate bodies of literature studying each of the two methods.

artificial intelligence, machine learning, randomized kaczmarz algorithm, (5 more...)

Neural Information Processing Systems

Jan-18-2025, 11:46:18 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.85)