Recursive Reasoning in Minimax Games: A Level k Gradient Play Method

Oct-11-2024, 11:51:26 GMT–Neural Information Processing Systems

Despite the success of generative adversarial networks (GANs) in generating visually appealing images, they are notoriously challenging to train. In order to stabilize the learning dynamics in minimax games, we propose a novel recursive reasoning algorithm: Level k Gradient Play (Lv. Our algorithm does not require sophisticated heuristics or second-order information, as do existing algorithms based on predictive updates. We show that as k increases, Lv. k GP converges asymptotically towards an accurate estimation of players' future strategy.Moreover, we justify that Lv. \infty GP naturally generalizes a line of provably convergent game dynamics which rely on predictive updates. Furthermore, we provide its local convergence property in nonconvex-nonconcave zero-sum games and global convergence in bilinear and quadratic games.

algorithm, gradient play method, recursive reasoning, (2 more...)

Neural Information Processing Systems

Oct-11-2024, 11:51:26 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Search (0.65)
  - Machine Learning > Neural Networks (0.43)