Goto

Collaborating Authors

 structured attentive reasoning


Learning Multi-Agent Communication through Structured Attentive Reasoning

Neural Information Processing Systems

Learning communication via deep reinforcement learning has recently been shown to be an effective way to solve cooperative multi-agent tasks. However, learning which communicated information is beneficial for each agent's decision-making process remains a challenging task. In order to address this problem, we explore relational reinforcement learning which leverages attention-based networks to learn efficient and interpretable relations between entities. On the foundation of relations, we introduce a novel communication architecture that exploits a memory-based attention network that selectively reasons about the value of information received from other agents while considering its past experiences. Specifically, the model communicates by first computing the relevance of messages received from other agents and then extracts task-relevant information from memories given the newly received information. We empirically demonstrate the strength of our model in cooperative and competitive multi-agent tasks, where inter-agent communication and reasoning over prior information substantially improves performance compared to baselines. We further show in the accompanying videos and experimental results that the agents learn a sophisticated and diverse set of cooperative behaviors to solve challenging tasks, both for discrete and continuous action spaces using on-policy and off-policy gradient methods. By developing an explicit architecture that is targeted towards communication, our work aims to open new directions to overcome important challenges in multi-agent cooperation through learned communication.


Review for NeurIPS paper: Learning Multi-Agent Communication through Structured Attentive Reasoning

Neural Information Processing Systems

I suggest finding a better example to motivate this approach. Few minor corrections: 1) Some of the citations are referred as arxiv submissions but are journal publications, so please update references (e.g [3], [10], [18], [24] -- which is also missing author information.


Review for NeurIPS paper: Learning Multi-Agent Communication through Structured Attentive Reasoning

Neural Information Processing Systems

All 4 reviewers suggest that this paper is above the acceptance threshold. After reading the reviews and author response, I will recommend acceptance with a note (see bellow). Reviewers agree that the idea of filtering what information from other agents to consider within a multi-agent setup, while not new, is an important area of research, especially as the number of agents and the complexity of the environments grows. All reviewers agree that this is a sound submission, with good experimental results and comprehensive comparisons with previous work. Some concerns were raised, but the author response did a good job in addressing many of those.


Learning Multi-Agent Communication through Structured Attentive Reasoning

Neural Information Processing Systems

Learning communication via deep reinforcement learning has recently been shown to be an effective way to solve cooperative multi-agent tasks. However, learning which communicated information is beneficial for each agent's decision-making process remains a challenging task. In order to address this problem, we explore relational reinforcement learning which leverages attention-based networks to learn efficient and interpretable relations between entities. On the foundation of relations, we introduce a novel communication architecture that exploits a memory-based attention network that selectively reasons about the value of information received from other agents while considering its past experiences. Specifically, the model communicates by first computing the relevance of messages received from other agents and then extracts task-relevant information from memories given the newly received information.