Approximating Nash Equilibria in General-Sum Games via Meta-Learning
Sychrovský, David, Solinas, Christopher, MacQueen, Revan, Wang, Kevin, Wright, James R., Sturtevant, Nathan R., Bowling, Michael
–arXiv.org Artificial Intelligence
Nash equilibrium is perhaps the best-known solution concept in game theory. Such a solution assigns a strategy to each player which offers no incentive to unilaterally deviate. While a Nash equilibrium is guaranteed to always exist, the problem of finding one in general-sum games is PPAD-complete, generally considered intractable. Regret minimization is an efficient framework for approximating Nash equilibria in two-player zero-sum games. However, in general-sum games, such algorithms are only guaranteed to converge to a coarse-correlated equilibrium (CCE), a solution concept where players can correlate their strategies. In this work, we use meta-learning to minimize the correlations in strategies produced by a regret minimizer. This encourages the regret minimizer to find strategies that are closer to a Nash equilibrium. The meta-learned regret minimizer is still guaranteed to converge to a CCE, but we give a bound on the distance to Nash equilibrium in terms of our meta-loss. We evaluate our approach in general-sum imperfect information games. Our algorithms provide significantly better approximations of Nash equilibria than state-of-the-art regret minimization techniques.
arXiv.org Artificial Intelligence
Apr-29-2025
- Country:
- Europe (0.67)
- North America
- Canada > Alberta (0.29)
- United States > California (0.46)
- Genre:
- Research Report (1.00)
- Industry:
- Leisure & Entertainment > Games (1.00)
- Technology:
- Information Technology
- Artificial Intelligence
- Games > Poker (0.46)
- Machine Learning > Neural Networks (0.46)
- Game Theory (1.00)
- Artificial Intelligence
- Information Technology