Intention Propagation for Multi-agent Reinforcement Learning
Qu, Chao, Li, Hui, Liu, Chang, Xiong, Junwu, Zhang, James, Chu, Wei, Qi, Yuan, Song, Le
Collaborative multi-agent reinforcement learning is an important sub-field of the multiagent reinforcement learning (MARL), where the agents learn to coordinate to achieve joint success. It has wide applications in traffic control [Kuyer et al., 2008], autonomous driving [Shalev-Shwartz et al., 2016] and smart grid [Yang et al., 2018]. To learn a coordination, the interactions between agents are indispensable. For instance, humans can reason about other's behaviors or know other peoples' intentions through communication and then determine an effective coordination plan. However, how to design a mechanism of such interaction in a principled way and at the same time solve the large scale real-world applications is still a challenging problem. Recently, there is a surge of interest in solving the collaborative MARL problem [Foerster et al., 2018, Qu et al., 2019, Lowe et al., 2017]. Among them, joint policy approaches have demonstrated their superiority [Rashid et al., 2018, Sunehag et al., 2018, Oliehoek et al., 2016]. A straightforward approach is to replace the action in the single-agent reinforcement learning by the joint action a (a 1, a 2,..., a N), while it obviously suffers from the issue of the exponentially large action space.
Apr-19-2020
- Country:
- North America > United States (0.04)
- Asia > China
- Zhejiang Province > Hangzhou (0.04)
- Shanghai > Shanghai (0.04)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Leisure & Entertainment (0.67)
- Transportation > Ground
- Road (0.66)
- Technology: