Understanding and Improving Model Averaging in Federated Learning on Heterogeneous Data
Zhou, Tailin, Lin, Zehong, Zhang, Jun, Tsang, Danny H. K.
–arXiv.org Artificial Intelligence
Model averaging is a widely adopted technique in federated learning (FL) that aggregates multiple client models to obtain a global model. Remarkably, model averaging in FL can yield a superior global model, even when client models are trained with non-convex objective functions and on heterogeneous local datasets. However, the rationale behind its success remains poorly understood. To shed light on this issue, we first visualize the loss landscape of FL over client and global models to illustrate their geometric properties. The visualization shows that the client models encompass the global model within a common basin, and interestingly, the global model may deviate from the bottom of the basin while still outperforming the client models. To gain further insights into model averaging in FL, we decompose the expected loss of the global model into five factors related to the client models. Specifically, our analysis reveals that the loss of the global model after early training mainly arises from \textit{i)} the client model's loss on non-overlapping data between client datasets and the global dataset and \textit{ii)} the maximum distance between the global and client models. Based on these findings from our loss landscape visualization and loss decomposition, we propose utilizing iterative moving averaging (IMA) on the global model at the late training phase to reduce its deviation from the expected minimum, while constraining client exploration to limit the maximum distance between the global and client models. Our experiments demonstrate that incorporating IMA into existing FL methods significantly improves their accuracy and training speed on various heterogeneous data setups of benchmark datasets.
arXiv.org Artificial Intelligence
Oct-16-2023
- Country:
- North America
- United States
- Texas > Travis County
- Austin (0.28)
- Pennsylvania
- Philadelphia County > Philadelphia (0.14)
- Allegheny County > Pittsburgh (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Massachusetts > Middlesex County
- Chelmsford (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- California
- San Diego County > San Diego (0.04)
- Los Angeles County > Long Beach (0.04)
- Monterey County
- Texas > Travis County
- Canada
- Quebec > Montreal (0.04)
- Ontario > Toronto (0.04)
- Nova Scotia > Halifax Regional Municipality
- Halifax (0.04)
- United States
- Europe
- Asia
- Middle East > Israel
- Tel Aviv District > Tel Aviv (0.04)
- Malaysia > Kuala Lumpur
- Kuala Lumpur (0.04)
- China
- Hong Kong (0.05)
- Guangdong Province > Guangzhou (0.04)
- Middle East > Israel
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- North America
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Education (0.68)
- Technology: