Solving Games with Functional Regret Estimation

Waugh, Kevin (Carnegie Mellon University) | Morrill, Dustin (University of Alberta) | Bagnell, James Andrew (Carnegie Mellon University) | Bowling, Michael (University of Alberta)

Mar-1-2015–AAAI Conferences

We propose a novel online learning method for minimizing regret in large extensive-form games. The approach learns a function approximator online to estimate the regret for choosing a particular action. A no-regret algorithm uses these estimates in place of the true regrets to define a sequence of policies. We prove the approach sound by providing a bound relating the quality of the function approximation and regret of the algorithm. A corollary being that the method is guaranteed to converge to a Nash equilibrium in self-play so long as the regrets are ultimately realizable by the function approximator. Our technique can be understood as a principled generalization of existing work on abstraction in large games; in our work, both the abstraction as well as the equilibrium are learned during self-play. We demonstrate empirically the method achieves higher quality strategies than state-of-the-art abstraction techniques given the same resources.

abstraction, artificial intelligence, game theory, (19 more...)

AAAI Conferences

Mar-1-2015

Conferences PDF

Add feedback

Country:
- North America
  - Canada > Alberta (0.29)
  - United States > Pennsylvania
    - Allegheny County > Pittsburgh (0.14)

Industry:
- Leisure & Entertainment > Games (1.00)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Statistical Learning (0.93)
    - Representation & Reasoning (1.00)
  - Game Theory (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found