Second-Order Mirror Descent: Convergence in Games Beyond Averaging and Discounting
–arXiv.org Artificial Intelligence
In this paper, we propose a second-order extension of the continuous-time game-theoretic mirror descent (MD) dynamics, referred to as MD2, which provably converges to mere (but not necessarily strict) variationally stable states (VSS) without using common auxiliary techniques such as time-averaging or discounting. We show that MD2 enjoys no-regret as well as an exponential rate of convergence towards strong VSS upon a slight modification. MD2 can also be used to derive many novel continuous-time primal-space dynamics. We then use stochastic approximation techniques to provide a convergence guarantee of discrete-time MD2 with noisy observations towards interior mere VSS. Selected simulations are provided to illustrate our results.
arXiv.org Artificial Intelligence
Jun-30-2023
- Country:
- Asia
- Europe
- Germany > Berlin (0.04)
- Romania > Nord-Est Development Region
- Iași County > Iași (0.04)
- Russia (0.04)
- Switzerland (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- North America
- Canada > Ontario
- Toronto (0.14)
- United States > Massachusetts
- Middlesex County > Cambridge (0.04)
- Canada > Ontario
- Genre:
- Research Report > New Finding (0.65)
- Industry:
- Leisure & Entertainment > Games (0.68)
- Technology: