AI-QMIX: Attention and Imagination for Dynamic Multi-Agent Reinforcement Learning

Iqbal, Shariq, de Witt, Christian A. Schroeder, Peng, Bei, Böhmer, Wendelin, Whiteson, Shimon, Sha, Fei

Jun-7-2020–arXiv.org Artificial Intelligence

Real world multi-agent tasks often involve varying types and quantities of agents and non-agent entities. Agents frequently do not know a priori how many other agents and non-agent entities they will need to interact with in order to complete a given task, requiring agents to generalize across a combinatorial number of task configurations with each potentially requiring different strategies. In this work, we tackle the problem of multi-agent reinforcement learning (MARL) in such dynamic scenarios. We hypothesize that, while the optimal behaviors in these scenarios with varying quantities and types of agents/entities are diverse, they may share common patterns within sub-teams of agents that are combined to form team behavior. As such, we propose a method that can learn these subgroup relationships and how they can be combined, ultimately improving knowledge sharing and generalization across scenarios. This method, Attentive-Imaginative QMIX, extends QMIX for dynamic MARL in two ways: 1) an attention mechanism that enables model sharing across variable sized scenarios and 2) a training objective that improves learning across scenarios with varying combinations of agent/entity types by factoring the value function into imagined sub-scenarios. We validate our approach on both a novel grid-world task as well as a version of the StarCraft Multi-Agent Challenge [28] minimally modified for the dynamic scenario setting.

artificial intelligence, scenario, soccer, (20 more...)

arXiv.org Artificial Intelligence

Jun-7-2020

arXiv.org PDF

Add feedback

Country:
- Europe (0.28)
- North America > United States
  - California (0.28)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Leisure & Entertainment > Sports > Soccer (0.47)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.46)
    - Neural Networks > Deep Learning (0.46)
    - Reinforcement Learning (1.00)
  - Representation & Reasoning > Agents
    - Agent Societies (0.69)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found