Local Steps Speed Up Local GD for Heterogeneous Distributed Logistic Regression

Crawshaw, Michael, Woodworth, Blake, Liu, Mingrui

Jan-23-2025–arXiv.org Artificial Intelligence

We analyze two variants of Local Gradient Descent applied to distributed logistic regression with heterogeneous, separable data and show convergence at the rate $O(1/KR)$ for $K$ local steps and sufficiently large $R$ communication rounds. In contrast, all existing convergence guarantees for Local GD applied to any problem are at least $\Omega(1/R)$, meaning they fail to show the benefit of local updates. The key to our improved guarantee is showing progress on the logistic regression objective when using a large stepsize $\eta \gg 1/K$, whereas prior analysis depends on $\eta \leq 1/K$.

artificial intelligence, exp, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Jan-23-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.27)

Genre:
- Research Report > New Finding (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.81)