Robust and Communication-Efficient Collaborative Learning

Reisizadeh, Amirhossein, Taheri, Hossein, Mokhtari, Aryan, Hassani, Hamed, Pedarsani, Ramtin

Mar-19-2020, 00:02:18 GMT–Neural Information Processing Systems

We consider a decentralized learning problem, where a set of computing nodes aim at solving a non-convex optimization problem collaboratively. It is well-known that decentralized optimization schemes face two major system bottlenecks: stragglers' delay and communication overhead. In this paper, we tackle these bottlenecks by proposing a novel decentralized and gradient-based optimization algorithm named as QuanTimed-DSGD. Our algorithm stands on two main ideas: (i) we impose a deadline on the local gradient computations of each node at each iteration of the algorithm, and (ii) the nodes exchange quantized versions of their local models. The key technical contribution of our work is to prove that with non-vanishing noises for quantization and stochastic gradients, the proposed method exactly converges to the global optimal for convex loss functions, and finds a first-order stationary point in non-convex scenarios.

bottleneck, quantimed-dsgd, robust and communication-efficient collaborative learning, (1 more...)

Neural Information Processing Systems

Mar-19-2020, 00:02:18 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (0.87)
  - Representation & Reasoning > Optimization (0.82)