A Unified Convergence Analysis for Shuffling-Type Gradient Methods
Nguyen, Lam M., Tran-Dinh, Quoc, Phan, Dzung T., Nguyen, Phuong Ha, van Dijk, Marten
In this paper, we provide a unified convergence analysis for a class of shuffling-type gradient methods for solving a well-known finite-sum minimization problem commonly used in machine learning. This algorithm covers various variants such as randomized reshuffling, single shuffling, and cyclic/incremental gradient schemes. We consider two different settings: strongly convex and non-convex problems. Our main contribution consists of new non-asymptotic and asymptotic convergence rates for a general class of shuffling-type gradient methods to solve both non-convex and strongly convex problems. While our rate in the non-convex problem is new (i.e. not known yet under standard assumptions), the rate on the strongly convex case matches (up to a constant) the best-known results. However, unlike existing works in this direction, we only use standard assumptions such as smoothness and strong convexity. Finally, we empirically illustrate the effect of learning rates via a non-convex logistic regression and neural network examples.
Feb-19-2020
- Country:
- Asia > Russia (0.04)
- North America > United States
- North Carolina > Orange County
- Chapel Hill (0.04)
- Connecticut > Tolland County
- Storrs (0.14)
- North Carolina > Orange County
- Europe
- Russia (0.04)
- Netherlands > South Holland
- Dordrecht (0.04)
- Genre:
- Research Report > New Finding (0.34)
- Industry:
- Education (0.67)
- Technology: