Solving Neural Min-Max Games: The Role of Architecture, Initialization & Dynamics

Patel, Deep, Vlatakis-Gkaragkounis, Emmanouil-Vasileios

Dec-2-2025–arXiv.org Machine Learning

Many emerging applications - such as adversarial training, AI alignment, and robust optimization - can be framed as zero-sum games between neural nets, with von Neumann-Nash equilibria (NE) capturing the desirable system behavior. While such games often involve non-convex non-concave objectives, empirical evidence shows that simple gradient methods frequently converge, suggesting a hidden geometric structure. In this paper, we provide a theoretical framework that explains this phenomenon through the lens of hidden convexity and overparameterization. We identify sufficient conditions - spanning initialization, training dynamics, and network width - that guarantee global convergence to a NE in a broad class of non-convex min-max games. To our knowledge, this is the first such result for games that involve two-layer neural networks. Technically, our approach is twofold: (a) we derive a novel path-length bound for the alternating gradient descent-ascent scheme in min-max games; and (b) we show that the reduction from a hidden convex-concave geometry to two-sided Polyak-Łojasiewicz (PŁ) min-max condition hold with high probability under overparameterization, using tools from random matrix theory.

neural network, optimization, overparameterization, (16 more...)

arXiv.org Machine Learning

Dec-2-2025

arXiv.org PDF

Add feedback

Country:
- Europe
  - Sweden > Stockholm
    - Stockholm (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
- North America > United States
  - Texas (0.04)
  - Wisconsin > Dane County
    - Madison (0.04)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (1.00)

Industry:
- Government (1.00)
- Information Technology (0.92)
- Leisure & Entertainment > Games (1.00)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning
      - Neural Networks (1.00)
      - Statistical Learning > Gradient Descent (0.48)
    - Representation & Reasoning
      - Agents (1.00)
      - Optimization (1.00)
  - Game Theory (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found