Constant Stepsize Local GD for Logistic Regression: Acceleration by Instability
Crawshaw, Michael, Woodworth, Blake, Liu, Mingrui
–arXiv.org Artificial Intelligence
Existing analysis of Local (Stochastic) Gradient Descent for heterogeneous objectives requires stepsizes $η\leq 1/K$ where $K$ is the communication interval, which ensures monotonic decrease of the objective. In contrast, we analyze Local Gradient Descent for logistic regression with separable, heterogeneous data using any stepsize $η> 0$. With $R$ communication rounds and $M$ clients, we show convergence at a rate $\mathcal{O}(1/ηK R)$ after an initial unstable phase lasting for $\widetilde{\mathcal{O}}(ηK M)$ rounds. This improves upon the existing $\mathcal{O}(1/R)$ rate for general smooth, convex objectives. Our analysis parallels the single machine analysis of~\cite{wu2024large} in which instability is caused by extremely large stepsizes, but in our setting another source of instability is large local updates with heterogeneous objectives.
arXiv.org Artificial Intelligence
Jun-18-2025
- Country:
- North America
- Canada (0.04)
- United States > District of Columbia
- Washington (0.04)
- Asia > Middle East
- Jordan (0.04)
- North America
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (0.63)
- Research Report
- Technology: