Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall
–Neural Information Processing Systems
We study the problem of learning a Nash equilibrium (NE) in an imperfect information game (IIG) through self-play.
Neural Information Processing Systems
Nov-14-2025, 07:21:09 GMT
- Country:
- Europe
- Germany > Saxony-Anhalt
- Magdeburg (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Germany > Saxony-Anhalt
- North America
- Canada > Alberta (0.14)
- United States (0.14)
- Europe
- Industry:
- Leisure & Entertainment > Games (1.00)