A Unified Convergence Analysis for Shuffling-Type Gradient Methods

Nguyen, Lam M., Tran-Dinh, Quoc, Phan, Dzung T., Nguyen, Phuong Ha, van Dijk, Marten

Feb-19-2020–arXiv.org Machine Learning

In this paper, we provide a unified convergence analysis for a class of shuffling-type gradient methods for solving a well-known finite-sum minimization problem commonly used in machine learning. This algorithm covers various variants such as randomized reshuffling, single shuffling, and cyclic/incremental gradient schemes. We consider two different settings: strongly convex and non-convex problems. Our main contribution consists of new non-asymptotic and asymptotic convergence rates for a general class of shuffling-type gradient methods to solve both non-convex and strongly convex problems. While our rate in the non-convex problem is new (i.e. not known yet under standard assumptions), the rate on the strongly convex case matches (up to a constant) the best-known results. However, unlike existing works in this direction, we only use standard assumptions such as smoothness and strong convexity. Finally, we empirically illustrate the effect of learning rates via a non-convex logistic regression and neural network examples.

algorithm 1, assumption, convergence rate, (14 more...)

arXiv.org Machine Learning

Feb-19-2020

arXiv.org PDF

Add feedback

Country:
- Asia > Russia (0.04)
- North America > United States
  - North Carolina > Orange County
    - Chapel Hill (0.04)
  - Connecticut > Tolland County
    - Storrs (0.14)
- Europe
  - Russia (0.04)
  - Netherlands > South Holland
    - Dordrecht (0.04)

Genre:
- Research Report > New Finding (0.34)

Industry:
- Education (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (0.93)
  - Machine Learning
    - Statistical Learning (1.00)
    - Neural Networks (0.88)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found