self-play monte-carlo tree search
Self-Play Monte-Carlo Tree Search in Computer Poker
Heinrich, Johannes (University College London) | Silver, David (University College London)
Self-play reinforcement learning has proved to be successful in many perfect information two-player games. However, research carrying over its theoretical guarantees and practical success to games of imperfect information has been lacking. In this paper, we evaluate self-play Monte-Carlo Tree Search in limit Texas Hold'em and Kuhn poker. We introduce a variant of the established UCB algorithm and provide first empirical results demonstrating its ability to find approximate Nash equilibria.