Stochastic Gradient Descent, Weighted Sampling, and the Randomized Kaczmarz algorithm

Needell, Deanna, Ward, Rachel, Srebro, Nati

Feb-14-2020, 07:12:17 GMT–Neural Information Processing Systems

We improve a recent gurantee of Bach and Moulines on the linear convergence of SGD for smooth and strongly convex objectives, reducing a quadratic dependence on the strong convexity to a linear dependence. Furthermore, we show how reweighting the sampling distribution (i.e. Our results are based on a connection we make between SGD and the randomized Kaczmarz algorithm, which allows us to transfer ideas between the separate bodies of literature studying each of the two methods. Papers published at the Neural Information Processing Systems Conference.

randomized kaczmarz algorithm, stochastic gradient descent, weighted sampling, (3 more...)

Neural Information Processing Systems

Feb-14-2020, 07:12:17 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.68)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.85)