XDO: ADoubleOracleAlgorithmfor Extensive-FormGames
–Neural Information Processing Systems
Policy Space Response Oracles (PSRO) is a reinforcement learning (RL) algorithm for two-player zero-sum games that has been empirically shown to find approximate Nash equilibria in large games.
Neural Information Processing Systems
Feb-11-2026, 01:02:49 GMT