Attention-Based Recurrence for Multi-Agent Reinforcement Learning under Stochastic Partial Observability

Phan, Thomy, Ritz, Fabian, Altmann, Philipp, Zorn, Maximilian, Nüßlein, Jonas, Kölle, Michael, Gabor, Thomas, Linnhoff-Popien, Claudia

Dec-27-2023–arXiv.org Artificial Intelligence

Stochastic partial observability poses a major challenge for decentralized coordination in multi-agent reinforcement learning but is largely neglected in state-of-the-art research due to a strong focus on state-based centralized training for decentralized execution (CTDE) and benchmarks that lack sufficient stochasticity like StarCraft Multi-Agent Challenge (SMAC). In this paper, we propose Attention-based Embeddings of Recurrence In multi-Agent Learning (AERIAL) to approximate value functions under stochastic partial observability. AERIAL replaces the true state with a learned representation of multi-agent recurrence, considering more accurate information about decentralized agent decisions than state-based CTDE. We then introduce MessySMAC, a modified version of SMAC with stochastic observations and higher variance in initial states, to provide a more general and configurable benchmark regarding stochastic partial observability. We evaluate AERIAL in Dec-Tiger as well as in a variety of SMAC and MessySMAC maps, and compare the results with state-based CTDE. Furthermore, we evaluate the robustness of AERIAL and state-based CTDE against various stochasticity configurations in MessySMAC.

aerial, attention-based recurrence, multi-agent reinforcement learning, (8 more...)

arXiv.org Artificial Intelligence

Dec-27-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Virginia > Arlington County
    - Arlington (0.04)
  - California
    - San Francisco County > San Francisco (0.04)
    - Los Angeles County > Long Beach (0.04)
- Europe > Germany
  - Bavaria > Upper Bavaria > Munich (0.04)

Genre:
- Research Report > New Finding (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (1.00)
  - Representation & Reasoning > Agents
    - Agent Societies (0.88)