Communication-Efficient Soft Actor-Critic Policy Collaboration via Regulated Segment Mixture in Internet of Vehicles
Yu, Xiaoxue, Li, Rongpeng, Liang, Chengchao, Zhao, Zhifeng
–arXiv.org Artificial Intelligence
Multi-Agent Reinforcement Learning (MARL) has emerged as a foundational approach for addressing diverse, intelligent control tasks, notably in autonomous driving within the Internet of Vehicles (IoV) domain. However, the widely assumed existence of a central node for centralized, federated learning-assisted MARL might be impractical in highly dynamic environments. This can lead to excessive communication overhead, potentially overwhelming the IoV system. To address these challenges, we design a novel communication-efficient and policy collaboration algorithm for MARL under the frameworks of Soft Actor-Critic (SAC) and Decentralized Federated Learning (DFL), named RSM-MASAC, within a fully distributed architecture. In particular, RSM-MASAC enhances multi-agent collaboration and prioritizes higher communication efficiency in dynamic IoV system by incorporating the concept of segmented aggregation in DFL and augmenting multiple model replicas from received neighboring policy segments, which are subsequently employed as reconstructed referential policies for mixing. Distinctively diverging from traditional RL approaches, with derived new bounds under Maximum Entropy Reinforcement Learning (MERL), RSM-MASAC adopts a theory-guided mixture metric to regulate the selection of contributive referential policies to guarantee the soft policy improvement during communication phase. Finally, the extensive simulations in mixed-autonomy traffic control scenarios verify the effectiveness and superiority of our algorithm.
arXiv.org Artificial Intelligence
Dec-15-2023
- Country:
- Asia
- China > Chongqing Province
- Chongqing (0.04)
- Malaysia > Kuala Lumpur
- Kuala Lumpur (0.04)
- Middle East > Israel
- Haifa District > Haifa (0.04)
- China > Chongqing Province
- Europe
- Denmark > Capital Region
- Kongens Lyngby (0.04)
- France > Hauts-de-France
- Italy > Tuscany
- Florence (0.04)
- Slovenia (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Denmark > Capital Region
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- California > Los Angeles County
- Long Beach (0.04)
- Colorado > Denver County
- Denver (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts > Hampshire County
- Amherst (0.04)
- New York
- Bronx County > New York City (0.04)
- Kings County > New York City (0.04)
- New York County > New York City (0.04)
- Queens County > New York City (0.04)
- Richmond County > New York City (0.04)
- California > Los Angeles County
- Canada > Quebec
- Oceania > Australia
- New South Wales > Sydney (0.14)
- South America > Brazil
- Rio de Janeiro > Rio de Janeiro (0.04)
- Asia
- Genre:
- Research Report (0.50)
- Industry:
- Transportation > Ground > Road (0.66)
- Technology: