A unified stochastic approximation framework for learning in games

Mertikopoulos, Panayotis, Hsieh, Ya-Ping, Cevher, Volkan

Jul-3-2023–arXiv.org Artificial Intelligence

We develop a flexible stochastic approximation framework for analyzing the long-run behavior of learning in games (both continuous and finite). The proposed analysis template incorporates a wide array of popular learning algorithms, including gradient-based methods, the exponential / multiplicative weights algorithm for learning in finite games, optimistic and bandit variants of the above, etc. In addition to providing an integrated view of these algorithms, our framework further allows us to obtain several new convergence results, both asymptotic and in finite time, in both continuous and finite games. Specifically, we provide a range of criteria for identifying classes of Nash equilibria and sets of action profiles that are attracting with high probability, and we also introduce the notion of coherence, a game-theoretic property that includes strict and sharp equilibria, and which leads to convergence in finite time. Importantly, our analysis applies to both oracle-based and bandit, payoff-based methods - that is, when players only observe their realized payoffs.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

Jul-3-2023

arXiv.org PDF

Add feedback

Country:
- Asia > Russia (0.04)
- North America > United States
  - Texas (0.04)
  - Rhode Island > Providence County
    - Providence (0.04)
  - New York > New York County
    - New York City (0.04)
- Europe
  - Russia (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.14)
  - Switzerland
    - Zürich > Zürich (0.14)
    - Vaud > Lausanne (0.04)
  - France > Auvergne-Rhône-Alpes
    - Isère > Grenoble (0.04)

Genre:
- Research Report (0.50)
- Workflow (0.45)

Technology:
- Information Technology
  - Mathematics of Computing (1.00)
  - Game Theory (1.00)
  - Artificial Intelligence
    - Representation & Reasoning > Optimization (0.92)
    - Machine Learning
      - Reinforcement Learning (0.67)
      - Statistical Learning > Gradient Descent (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found