Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning

Christianos, Filippos, Papoudakis, Georgios, Albrecht, Stefano V.

Oct-14-2023–arXiv.org Artificial Intelligence

This work focuses on equilibrium selection in no-conflict multi-agent games, where we specifically study the problem of selecting a Pareto-optimal Nash equilibrium among several existing equilibria. It has been shown that many state-of-the-art multi-agent reinforcement learning (MARL) algorithms are prone to converging to Pareto-dominated equilibria due to the uncertainty each agent has about the policy of the other agents during training. To address sub-optimal equilibrium selection, we propose Pareto Actor-Critic (Pareto-AC), which is an actor-critic algorithm that utilises a simple property of no-conflict games (a superset of cooperative games): the Pareto-optimal equilibrium in a no-conflict game maximises the returns of all agents and, therefore, is the preferred outcome for all agents. We evaluate Pareto-AC in a diverse set of multi-agent games and show that it converges to higher episodic returns compared to seven state-of-the-art MARL algorithms and that it successfully converges to a Pareto-optimal equilibrium in a range of matrix games. Finally, we propose PACDCG, a graph neural network extension of Pareto-AC, which is shown to efficiently scale in games with a large number of agents.

agent, algorithm, q-value, (14 more...)

arXiv.org Artificial Intelligence

Oct-14-2023

arXiv.org PDF

Add feedback

Country:
- Asia > Japan (0.04)
- North America
  - United States
    - Virginia > Arlington County
      - Arlington (0.04)
    - California > San Francisco County
      - San Francisco (0.14)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
- Europe
  - Netherlands (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.14)
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)

Genre:
- Research Report (0.82)

Industry:
- Leisure & Entertainment > Games (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (1.00)
  - Representation & Reasoning > Agents
    - Agent Societies (0.49)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found