Deep Counterfactual Regret Minimization
Brown, Noam, Lerer, Adam, Gross, Sam, Sandholm, Tuomas
–arXiv.org Artificial Intelligence
Counterfactual Regret Minimization (CFR) is the leading algorithm for solving large imperfect-information games. It iteratively traverses the game tree in order to converge to a Nash equilibrium. In order to deal with extremely large games, CFR typically uses domain-specific heuristics to simplify the target game in a process known as abstraction. This simplified game is solved with tabular CFR, and its solution is mapped back to the full game. This paper introduces Deep Counterfactual Regret Minimization (Deep CFR), a form of CFR that obviates the need for abstraction by instead using deep neural networks to approximate the behavior of CFR in the full game. We show that Deep CFR is principled and achieves strong performance in large poker games. This is the first non-tabular variant of CFR to be successful in large games.
arXiv.org Artificial Intelligence
Nov-13-2018
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America
- Canada > Alberta (0.14)
- United States
- New York > New York County
- New York City (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.14)
- Texas (0.05)
- New York > New York County
- Europe > United Kingdom
- Genre:
- Research Report (0.50)
- Industry:
- Leisure & Entertainment > Games (1.00)
- Technology: