GPU-Accelerated Counterfactual Regret Minimization

Sep-6-2024–arXiv.org Artificial Intelligence

Counterfactual regret minimization is a family of algorithms of no-regret learning dynamics capable of solving large-scale imperfect information games. We propose implementing this algorithm as a series of dense and sparse matrix and vector operations, thereby making it highly parallelizable for a graphical processing unit, at a cost of higher memory usages. Our experiments show that our implementation performs up to about 352.5 times faster than OpenSpiel's Python implementation and up to about 22.2 times faster than OpenSpiel's C++ implementation and the speedup becomes more pronounced as the size of the game being solved grows. Counterfactual regret minimization (CFR) (Zinkevich et al., 2007) is a family of algorithms of noregret learning dynamics capable of solving large-scale imperfect information games. Its variants dominated the development of AI agents for large imperfect information games like Poker (Tammelin et al., 2015; Moravčík et al., 2017; Brown & Sandholm, 2018; 2019b) and The Resistance: Avalon (Serrino et al., 2019) and were components of ReBeL (Brown et al., 2020) and student of games (Schmid et al., 2023).

implementation, openspiel, pvq, (16 more...)

arXiv.org Artificial Intelligence

Sep-6-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > Texas (0.04)
  - Canada > Ontario
    - Toronto (0.14)

Genre:
- Research Report (0.40)

Industry:
- Leisure & Entertainment > Games (1.00)

Technology:
- Information Technology
  - Hardware (1.00)
  - Game Theory (1.00)
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning > Agents (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found