Second-Order Mirror Descent: Convergence in Games Beyond Averaging and Discounting

Jun-30-2023–arXiv.org Artificial Intelligence

In this paper, we propose a second-order extension of the continuous-time game-theoretic mirror descent (MD) dynamics, referred to as MD2, which provably converges to mere (but not necessarily strict) variationally stable states (VSS) without using common auxiliary techniques such as time-averaging or discounting. We show that MD2 enjoys no-regret as well as an exponential rate of convergence towards strong VSS upon a slight modification. MD2 can also be used to derive many novel continuous-time primal-space dynamics. We then use stochastic approximation techniques to provide a convergence guarantee of discrete-time MD2 with noisy observations towards interior mere VSS. Selected simulations are provided to illustrate our results.

artificial intelligence, converge, machine learning, (18 more...)

arXiv.org Artificial Intelligence

Jun-30-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > Massachusetts
    - Middlesex County > Cambridge (0.04)
  - Canada > Ontario
    - Toronto (0.14)
- Europe
  - Switzerland (0.04)
  - Russia (0.04)
  - Germany > Berlin (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Romania > Nord-Est Development Region
    - Iași County > Iași (0.04)
- Asia
  - Russia (0.04)
  - Japan (0.04)

Genre:
- Research Report > New Finding (0.65)

Industry:
- Leisure & Entertainment > Games (0.68)

Technology:
- Information Technology
  - Game Theory (1.00)
  - Mathematics of Computing (0.87)
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning > Agents (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found