tammelin
- North America > United States > Texas (0.04)
- North America > Canada > Alberta (0.04)
Tammelin
Cepheus is the first computer program to essentially solve a game of imperfect information that is played competitively by humans. The game it plays is heads-up limit Texas hold'em poker, a game with over 10 14 information sets, and a challenge problem for artificial intelligence for over 10 years. Cepheus was trained using a new variant of Counterfactual Regret Minimization (CFR), called CFR, using 4800 CPUs running for 68 days. In this paper we describe in detail the engineering details required to make this computation a reality. We also prove the theoretical soundness of CFR and its component algorithm, regret-matching . We further give a hint towards understanding the success of CFR by proving a tracking regret bound for this new regret matching algorithm.
Revisiting CFR+ and Alternating Updates
Burch, Neil, Moravcik, Matej, Schmid, Martin
The CFR+ algorithm for solving imperfect information games is a variant of the popular CFR algorithm, with faster empirical performance on a range of problems. It was introduced with a theoretical upper bound on solution error, but subsequent work showed an error in one step of the proof. We provide updated proofs to recover the original bound.
- North America > United States > Texas (0.04)
- Europe > United Kingdom (0.04)