$\ell_p$-Regression in the Arbitrary Partition Model of Communication
Li, Yi, Lin, Honghao, Woodruff, David P.
–arXiv.org Artificial Intelligence
We consider the randomized communication complexity of the distributed $\ell_p$-regression problem in the coordinator model, for $p\in (0,2]$. In this problem, there is a coordinator and $s$ servers. The $i$-th server receives $A^i\in\{-M, -M+1, \ldots, M\}^{n\times d}$ and $b^i\in\{-M, -M+1, \ldots, M\}^n$ and the coordinator would like to find a $(1+\epsilon)$-approximate solution to $\min_{x\in\mathbb{R}^n} \|(\sum_i A^i)x - (\sum_i b^i)\|_p$. Here $M \leq \mathrm{poly}(nd)$ for convenience. This model, where the data is additively shared across servers, is commonly referred to as the arbitrary partition model. We obtain significantly improved bounds for this problem. For $p = 2$, i.e., least squares regression, we give the first optimal bound of $\tilde{\Theta}(sd^2 + sd/\epsilon)$ bits. For $p \in (1,2)$,we obtain an $\tilde{O}(sd^2/\epsilon + sd/\mathrm{poly}(\epsilon))$ upper bound. Notably, for $d$ sufficiently large, our leading order term only depends linearly on $1/\epsilon$ rather than quadratically. We also show communication lower bounds of $\Omega(sd^2 + sd/\epsilon^2)$ for $p\in (0,1]$ and $\Omega(sd^2 + sd/\epsilon)$ for $p\in (1,2]$. Our bounds considerably improve previous bounds due to (Woodruff et al. COLT, 2013) and (Vempala et al., SODA, 2020).
arXiv.org Artificial Intelligence
Jul-11-2023
- Country:
- Asia > Singapore (0.04)
- Europe
- Germany > Baden-Württemberg
- Freiburg (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Germany > Baden-Württemberg
- North America > United States
- California
- Alameda County > Berkeley (0.04)
- Los Angeles County > Long Beach (0.04)
- San Diego County > San Diego (0.04)
- Santa Clara County > San Jose (0.04)
- Colorado
- Boulder County > Boulder (0.04)
- Denver County > Denver (0.04)
- Maryland > Montgomery County
- Bethesda (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Utah > Salt Lake County
- Salt Lake City (0.04)
- California
- Genre:
- Research Report (0.50)
- Industry:
- Government > Regional Government (0.47)
- Technology: