Efficient Monte Carlo Counterfactual Regret Minimization in Games with Many Player Actions

Burch, Neil, Lanctot, Marc, Szafron, Duane, Gibson, Richard G.

Dec-31-2012–Neural Information Processing Systems

Counterfactual Regret Minimization (CFR) is a popular, iterative algorithm for computing strategies in extensive-form games. The Monte Carlo CFR (MCCFR) variants reduce the per iteration time cost of CFR by traversing a smaller, sampled portion of the tree. The previous most effective instances of MCCFR can still be very slow in games with many player actions since they sample every action for a given player. In this paper, we present a new MCCFR algorithm, Average Strategy Sampling(AS), that samples a subset of the player's actions according to the player's average strategy. Our new algorithm is inspired by a new, tighter bound on the number of iterations required by CFR to converge to a given solution quality. In addition, we prove a similar, tighter bound for AS and other popular MCCFR variants.

artificial intelligence, game theory, information, (16 more...)

Neural Information Processing Systems

Dec-31-2012

Conferences PDF

Add feedback

Country:
- North America > Canada > Alberta (0.29)

Industry:
- Leisure & Entertainment > Games > Poker (0.46)

Technology:
- Information Technology
  - Artificial Intelligence
    - Games > Poker (0.47)
    - Representation & Reasoning (0.68)
  - Game Theory (0.95)

Duplicate Docs Excel Report

Title
Efficient Monte Carlo Counterfactual Regret Minimization in Games with Many Player Actions

Similar Docs Excel Report more

Title	Similarity	Source
None found