Intention Propagation for Multi-agent Reinforcement Learning

Qu, Chao, Li, Hui, Liu, Chang, Xiong, Junwu, Zhang, James, Chu, Wei, Qi, Yuan, Song, Le

Apr-19-2020–arXiv.org Machine Learning

Collaborative multi-agent reinforcement learning is an important sub-field of the multiagent reinforcement learning (MARL), where the agents learn to coordinate to achieve joint success. It has wide applications in traffic control [Kuyer et al., 2008], autonomous driving [Shalev-Shwartz et al., 2016] and smart grid [Yang et al., 2018]. To learn a coordination, the interactions between agents are indispensable. For instance, humans can reason about other's behaviors or know other peoples' intentions through communication and then determine an effective coordination plan. However, how to design a mechanism of such interaction in a principled way and at the same time solve the large scale real-world applications is still a challenging problem. Recently, there is a surge of interest in solving the collaborative MARL problem [Foerster et al., 2018, Qu et al., 2019, Lowe et al., 2017]. Among them, joint policy approaches have demonstrated their superiority [Rashid et al., 2018, Sunehag et al., 2018, Oliehoek et al., 2016]. A straightforward approach is to replace the action in the single-agent reinforcement learning by the joint action a (a 1, a 2,..., a N), while it obviously suffers from the issue of the exponentially large action space.

agent, algorithm, average reward, (15 more...)

arXiv.org Machine Learning

Apr-19-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.04)
- Asia > China
  - Zhejiang Province > Hangzhou (0.04)
  - Shanghai > Shanghai (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Leisure & Entertainment (0.67)
- Transportation > Ground
  - Road (0.66)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (1.00)
  - Representation & Reasoning > Agents
    - Agent Societies (0.88)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found