Stable Nonconvex-Nonconcave Training via Linear Interpolation
–Neural Information Processing Systems
By replacing the inner optimizer in RAPP we rediscover the family of Lookahead algorithms for which we establish convergence in cohypomonotone problems even when the base optimizer is taken to be gradient descent ascent.
Neural Information Processing Systems
Oct-9-2025, 02:37:55 GMT