Communication-Efficient Cooperative Multi-Agent PPO via Regulated Segment Mixture in Internet of Vehicles

Yu, Xiaoxue, Li, Rongpeng, Wang, Fei, Peng, Chenghui, Liang, Chengchao, Zhao, Zhifeng, Zhang, Honggang

Aug-8-2023–arXiv.org Artificial Intelligence

Multi-Agent Reinforcement Learning (MARL) has become a classic paradigm to solve diverse, intelligent control tasks like autonomous driving in Internet of Vehicles (IoV). However, the widely assumed existence of a central node to implement centralized federated learning-assisted MARL might be impractical in highly dynamic scenarios, and the excessive communication overheads possibly overwhelm the IoV system. Therefore, in this paper, we design a communication efficient cooperative MARL algorithm, named RSM-MAPPO, to reduce the communication overheads in a fully distributed architecture. In particular, RSM-MAPPO enhances the multi-agent Proximal Policy Optimization (PPO) by incorporating the idea of segment mixture and augmenting multiple model replicas from received neighboring policy segments. Afterwards, RSM-MAPPO adopts a theory-guided metric to regulate the selection of contributive replicas to guarantee the policy improvement. Finally, extensive simulations in a mixed-autonomy traffic control scenario verify the effectiveness of the RSM-MAPPO algorithm.

agent, artificial intelligence, machine learning, (14 more...)

arXiv.org Artificial Intelligence

Aug-8-2023

arXiv.org PDF

Add feedback

Country:
- South America > Brazil
  - Rio de Janeiro > Rio de Janeiro (0.04)
- North America
  - United States > Massachusetts (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Europe
  - Switzerland > Zürich
    - Zürich (0.14)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Denmark > Capital Region
    - Kongens Lyngby (0.04)
- Asia > China
  - Chongqing Province > Chongqing (0.04)

Genre:
- Research Report (0.50)

Industry:
- Education (0.68)
- Information Technology > Robotics & Automation (0.34)
- Transportation > Ground
  - Road (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Representation & Reasoning > Agents
    - Agent Societies (0.50)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found