Improved Communication Efficiency in Federated Natural Policy Gradient via ADMM-based Gradient Updates

Jan-19-2025, 20:46:00 GMT–Neural Information Processing Systems

Federated reinforcement learning (FedRL) enables agents to collaboratively train a global policy without sharing their individual data. However, high communication overhead remains a critical bottleneck, particularly for natural policy gradient (NPG) methods, which are second-order. To address this issue, we propose the FedNPG-ADMM framework, which leverages the alternating direction method of multipliers (ADMM) to approximate global NPG directions efficiently. We theoretically demonstrate that using ADMM-based gradient updates reduces communication complexity from \mathcal{O}({d {2}}) to \mathcal{O}({d}) at each iteration, where d is the number of model parameters. Through evaluation of the proposed algorithms in MuJoCo environments, we demonstrate that FedNPG-ADMM maintains the reward performance of standard FedNPG, and that its convergence rate improves when the number of federated agents increases.

admm-based gradient update, federated natural policy gradient, improved communication efficiency, (5 more...)

Neural Information Processing Systems

Jan-19-2025, 20:46:00 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.65)