R-MADDPG for Partially Observable Environments and Limited Communication

Wang, Rose E., Everett, Michael, How, Jonathan P.

Feb-17-2020–arXiv.org Artificial Intelligence

There are several real-world tasks that would benefit from applying multiagent reinforcement learning (MARL) algorithms, including the coordination among self-driving cars. The real world has challenging conditions for multiagent learning systems, such as its partial observable and nonstationary nature. Moreover, if agents must share a limited resource (e.g. network bandwidth) they must all learn how to coordinate resource use. This paper introduces a deep recurrent multiagent actor-critic framework (R-MADDPG) for handling multiagent coordination under partial observable set-tings and limited communication. We investigate recurrency effects on performance and communication use of a team of agents. We demonstrate that the resulting framework learns time dependencies for sharing missing observations, handling resource limitations, and developing different communication patterns among agents.

agent, communication budget, r-maddpg, (13 more...)

arXiv.org Artificial Intelligence

Feb-17-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.14)

Genre:
- Research Report (0.82)

Industry:
- Leisure & Entertainment (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (0.68)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.69)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found