Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing

Jan-19-2025, 03:26:11 GMT–Neural Information Processing Systems

Learning in real-world multiagent tasks is challenging due to the usual partial observability of each agent. Previous efforts alleviate the partial observability by historical hidden states with Recurrent Neural Networks, however, they do not consider the multiagent characters that either the multiagent observation consists of a number of object entities or the action space shows clear entity interactions. To tackle these issues, we propose the Agent Transformer Memory (ATM) network with a transformer-based memory. First, ATM utilizes the transformer to enable the unified processing of the factored environmental entities and memory. Inspired by the human's working memory process where a limited capacity of information temporarily held in mind can effectively guide the decision-making, ATM updates its fixed-capacity memory with the working memory updating schema.

action parsing, multiagent reinforcement learning, transformer-based working memory, (1 more...)

Neural Information Processing Systems

Jan-19-2025, 03:26:11 GMT

Conferences Web Page

Add feedback

Industry:
- Health & Medicine (0.89)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.98)