Oracles & Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning

Gerstgrasser, Matthias, Parkes, David C.

Jun-1-2023–arXiv.org Artificial Intelligence

Stackelberg equilibria arise naturally in a range of popular learning problems, such as in security games or indirect mechanism design, and have received increasing attention in the reinforcement learning literature. We present a general framework for implementing Stackelberg equilibria search as a multi-agent RL problem, allowing a wide range of algorithmic design choices. We discuss how previous approaches can be seen as specific instantiations of this framework. As a key insight, we note that the design space allows for approaches not previously seen in the literature, for instance by leveraging multitask and meta-RL techniques for follower convergence. We propose one such approach using contextual policies, and evaluate it experimentally on both standard and novel benchmark domains, showing greatly improved sample efficiency compared to previous approaches. Finally, we explore the effect of adopting algorithm designs outside the borders of our framework.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

Jun-1-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
  - California > Santa Clara County
    - Stanford (0.04)
- Europe > United Kingdom
  - England > Greater London > London (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report (0.82)
- Workflow (0.68)

Industry:
- Leisure & Entertainment > Games
  - Computer Games (0.34)
- Education > Curriculum
  - Subject-Specific Education (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found