Team-PSRO for Learning Approximate TMECor in Large Team Games via Cooperative Reinforcement Learning
–Neural Information Processing Systems
Recent algorithms have achieved superhuman performance at a number of twoplayer zero-sum games such as poker and go. However, many real-world situations are multi-player games. Zero-sum two-team games, such as bridge and football, involve two teams where each member of the team shares the same reward with every other member of that team, and each team has the negative of the reward of the other team. A popular solution concept in this setting, called TMECor, assumes that teams can jointly correlate their strategies before play, but are not able to communicate during play. This setting is harder than two-player zerosum games because each player on a team has different information and must use their public actions to signal to other members of the team.
Neural Information Processing Systems
May-25-2025, 04:40:04 GMT
- Country:
- North America > United States (0.14)
- Genre:
- Research Report (0.93)
- Industry:
- Leisure & Entertainment
- Games > Computer Games (0.68)
- Sports > Soccer (0.67)
- Leisure & Entertainment
- Technology: