Learning Reward Machines in Cooperative Multi-Agent Tasks

Ardon, Leo, Furelos-Blanco, Daniel, Russo, Alessandra

May-24-2023–arXiv.org Artificial Intelligence

This paper presents a novel approach to Multi-Agent Reinforcement Learning (MARL) that combines cooperative task decomposition with the learning of reward machines (RMs) encoding the structure of the sub-tasks. The proposed method helps deal with the non-Markovian nature of the rewards in partially observable environments and improves the interpretability of the learnt policies required to complete the cooperative task. The RMs associated with each sub-task are learnt in a decentralised manner and then used to guide the behaviour of each agent. By doing so, the complexity of a cooperative multi-agent problem is reduced, allowing for more effective learning. The results suggest that our approach is a promising direction for future research in MARL, especially in complex environments with large state spaces and multiple agents.

agent, artificial intelligence, rms, (15 more...)

arXiv.org Artificial Intelligence

May-24-2023

arXiv.org PDF

Add feedback

Genre:
- Research Report
  - New Finding (0.34)
  - Promising Solution (0.34)

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found