Dynamic Adaptation and Opponent Exploitation in Computer Poker
Li, Xun (The University of Texas at Austin) | Miikkulainen, Risto (The University of Texas at Austin)
As a classic example of imperfect information games, Heads-Up No-limit Texas Holdem (HUNL), has been studied extensively in recent years. While state-of-the-art approaches based on Nash equilibrium have been successful, they lack the ability to model and exploit opponents effectively. This paper presents an evolutionary approach to discover opponent models based Long Short Term Memory neural networks and on Pattern Recognition Trees. Experimental results showed that poker agents built in this method can adapt to opponents they have never seen in training and exploit weak strategies far more effectively than Slumbot 2017, one of the cutting-edge Nash-equilibrium-based poker agents. In addition, agents evolved through playing against relatively weak rule-based opponents tied statistically with Slumbot in heads-up matches. Thus, the proposed approach is a promising new direction for building high-performance adaptive agents in HUNL and other imperfect information games.
Apr-6-2018
- Country:
- North America > United States > Texas (0.24)
- Genre:
- Research Report (0.73)
- Industry:
- Leisure & Entertainment > Games > Poker (0.40)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (1.00)
- Games > Poker (1.00)
- Information Technology > Artificial Intelligence