DR-DSGD: A Distributionally Robust Decentralized Learning Algorithm over Graphs

Issaid, Chaouki Ben, Elgabli, Anis, Bennis, Mehdi

Sep-12-2022–arXiv.org Artificial Intelligence

In this paper, we propose to solve a regularized distributionally robust learning problem in the decentralized setting, taking into account the data distribution shift. By adding a Kullback-Liebler regularization function to the robust min-max optimization problem, the learning problem can be reduced to a modified robust minimization problem and solved efficiently. Leveraging the newly formulated optimization problem, we propose a robust version of Decentralized Stochastic Gradient Descent (DSGD), coined Distributionally Robust Decentralized Stochastic Gradient Descent (DR-DSGD). Under some mild assumptions and provided that the regularization parameter is larger than one, we theoretically prove that DR-DSGD achieves a convergence rate of $\mathcal{O}\left(1/\sqrt{KT} + K/T\right)$, where $K$ is the number of devices and $T$ is the number of iterations. Simulation results show that our proposed algorithm can improve the worst distribution test accuracy by up to $10\%$. Moreover, DR-DSGD is more communication-efficient than DSGD since it requires fewer communication rounds (up to $20$ times less) to achieve the same worst distribution test accuracy target. Furthermore, the conducted experiments reveal that DR-DSGD results in a fairer performance across devices in terms of test accuracy.

accuracy, algorithm, test accuracy, (13 more...)

arXiv.org Artificial Intelligence

Sep-12-2022

arXiv.org PDF

Add feedback

Country:
- Asia (0.04)
- North America > United States
  - Virginia (0.04)
  - California > Los Angeles County
    - Pasadena (0.04)
- Europe > Finland
  - Northern Ostrobothnia > Oulu (0.04)

Genre:
- Research Report > New Finding (0.48)

Industry:
- Energy (0.46)
- Education > Focused Education
  - Special Education (0.45)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Statistical Learning
    - Gradient Descent (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found