Learning Efficient Multi-agent Communication: An Information Bottleneck Approach
Wang, Rundong, He, Xu, Yu, Runsheng, Qiu, Wei, An, Bo, Rabinovich, Zinovi
–arXiv.org Artificial Intelligence
Many real-world multi-agent reinforcement learning applications require agents to communicate, assisted by a communication protocol. These applications face a common and critical issue of communication's limited bandwidth that constrains agents' ability to cooperate successfully. In this paper, rather than proposing a fixed communication protocol, we develop an Informative Multi-Agent Communication (IMAC) method to learn efficient communication protocols. Our contributions are threefold. First, we notice a fact that a limited bandwidth translates into a constraint on the communicated message entropy, thus paving the way of controlling the bandwidth. Second, we introduce a customized batch-norm layer, which controls the messages' entropy to simulate the limited bandwidth constraint. Third, we apply the information bottleneck method to discover the optimal communication protocol, which can satisfy a bandwidth constraint via training with the prior distribution in the method. To demonstrate the efficacy of our method, we conduct extensive experiments in various cooperative and competitive multi-agent tasks across two dimensions: the number of agents and different bandwidths. We show that IMAC converges fast, and leads to efficient communication among agents under the limited-bandwidth constraint as compared to many baseline methods.
arXiv.org Artificial Intelligence
Nov-16-2019
- Country:
- North America > United States
- Montana (0.04)
- Asia > Japan
- Honshū
- Tōhoku > Fukushima Prefecture
- Fukushima (0.04)
- Chūbu > Ishikawa Prefecture
- Kanazawa (0.04)
- Tōhoku > Fukushima Prefecture
- Honshū
- North America > United States
- Genre:
- Research Report (0.50)
- Industry:
- Energy > Power Industry > Utilities > Nuclear (0.67)
- Technology: