Finding Second-Order Stationary Points in Nonconvex-Strongly-Concave Minimax Optimization

Jan-19-2025, 06:02:34 GMT–Neural Information Processing Systems

Most of existing works focus on finding the first-order stationary point of the function f({\bf x},{\bf y}) or its primal function P({\bf x})\triangleq \max_{\bf y} f({\bf x},{\bf y}), but few of them focus on achieving the second-order stationary point, which is essential to nonconvex problems. In addition, we propose an inexact variant of MCN for high-dimensional problems to avoid calling the expensive second-order oracles. Instead, our method solves the cubic sub-problem inexactly via gradient descent and matrix Chebyshev expansion. This strategy still obtains the desired approximate second-order stationary point with high probability but only requires \tilde{\mathcal O}\left(\kappa {1.5}\ell\varepsilon {-2}\right) Hessian-vector oracle calls and \tilde{\mathcal O}\left(\kappa {2}\sqrt{\rho}\varepsilon {-1.5}\right) first-order oracle calls. To the best of our knowledge, this is the first work that considers the non-asymptotic convergence behavior of finding second-order stationary points for minimax problems without the convex-concave assumptions.

nonconvex-strongly-concave minimax optimization, stationary point, varepsilon, (4 more...)

Neural Information Processing Systems

Jan-19-2025, 06:02:34 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.72)