Near-Optimal No-Regret Learning in General Games

Daskalakis, Constantinos, Fishelson, Maxwell, Golowich, Noah

Jan-24-2023–arXiv.org Artificial Intelligence

We show that Optimistic Hedge -- a common variant of multiplicative-weights-updates with recency bias -- attains ${\rm poly}(\log T)$ regret in multi-player general-sum games. In particular, when every player of the game uses Optimistic Hedge to iteratively update her strategy in response to the history of play so far, then after $T$ rounds of interaction, each player experiences total regret that is ${\rm poly}(\log T)$. Our bound improves, exponentially, the $O({T}^{1/2})$ regret attainable by standard no-regret learners in games, the $O(T^{1/4})$ regret attainable by no-regret learners with recency bias (Syrgkanis et al., 2015), and the ${O}(T^{1/6})$ bound that was recently shown for Optimistic Hedge in the special case of two-player games (Chen & Pen, 2020). A corollary of our bound is that Optimistic Hedge converges to coarse correlated equilibrium in general games at a rate of $\tilde{O}\left(\frac 1T\right)$.

artificial intelligence, lemma 4, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Jan-24-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - New York
      - New York County > New York City (0.04)
      - Tompkins County > Ithaca (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
    - California
      - San Diego County > San Diego (0.04)
      - Los Angeles County > Santa Monica (0.04)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report (0.40)

Industry:
- Leisure & Entertainment > Games (0.68)

Technology:
- Information Technology
  - Game Theory (1.00)
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found