Learning to Play No-Press Diplomacy with Best Response Policy Iteration

May-27-2025, 12:27:33 GMT–Neural Information Processing Systems

Recent advances in deep reinforcement learning (RL) have led to considerable progress in many 2-player zero-sum games, such as Go, Poker and Starcraft. The purely adversarial nature of such games allows for conceptually simple and principled application of RL methods. However real-world settings are many-agent, and agent interactions are complex mixtures of common-interest and competitive aspects. We consider Diplomacy, a 7-player board game designed to accentuate dilemmas resulting from many-agent interactions. It also features a large combinatorial action space and simultaneous moves, which are challenging for RL algorithms.

diplomacy, machine learning, reinforcement learning, (6 more...)

Neural Information Processing Systems

May-27-2025, 12:27:33 GMT

Conferences Web Page

Add feedback

Industry:
- Leisure & Entertainment > Games (0.63)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)