MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning

Samvelyan, Mikayel, Khan, Akbir, Dennis, Michael, Jiang, Minqi, Parker-Holder, Jack, Foerster, Jakob, Raileanu, Roberta, Rocktäschel, Tim

Mar-6-2023–arXiv.org Artificial Intelligence

Open-ended learning methods that automatically generate a curriculum of increasingly challenging tasks serve as a promising avenue toward generally capable reinforcement learning agents. Existing methods adapt curricula independently over either environment parameters (in single-agent settings) or co-player policies (in multi-agent settings). However, the strengths and weaknesses of co-players can manifest themselves differently depending on environmental features. It is thus crucial to consider the dependency between the environment and co-player when shaping a curriculum in multi-agent domains. In this work, we use this insight and extend Unsupervised Environment Design (UED) to multi-agent environments. We then introduce Multi-Agent Environment Design Strategist for Open-Ended Learning (MAESTRO), the first multi-agent UED approach for two-player zero-sum settings. MAESTRO efficiently produces adversarial, joint curricula over both environments and co-players and attains minimax-regret guarantees at Nash equilibrium. Our experiments show that MAESTRO outperforms a number of strong baselines on competitive two-player games, spanning discrete and continuous control settings.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

arXiv.org Artificial Intelligence

Mar-6-2023

arXiv.org PDF

Add feedback

Country:
- Asia
  - China (0.04)
  - Japan > Honshū
    - Kansai > Osaka Prefecture > Osaka (0.04)
  - Malaysia (0.04)
  - Middle East
    - Bahrain (0.04)
    - Republic of Türkiye > Karaman Province
      - Karaman (0.04)
  - Russia (0.04)
  - Singapore (0.04)
- Europe
  - Hungary (0.04)
  - Portugal (0.04)
  - Belgium (0.04)
  - Russia (0.04)
  - Italy (0.04)
  - France (0.04)
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
  - Monaco (0.04)
  - Netherlands (0.04)
  - Spain (0.04)
  - Germany (0.04)
  - Austria (0.04)
- North America
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
  - Mexico (0.04)
  - United States > Massachusetts
    - Middlesex County > Cambridge (0.04)
- Oceania > Australia (0.04)
- South America > Brazil (0.04)

Genre:
- Research Report (0.82)

Industry:
- Education (1.00)
- Leisure & Entertainment
  - Games (1.00)
  - Sports > Motorsports
    - Formula One (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (1.00)
  - Representation & Reasoning > Agents (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found