Stable Nonconvex-Nonconcave Training via Linear Interpolation

Neural Information Processing Systems 

By replacing the inner optimizer in RAPP we rediscover the family of Lookahead algorithms for which we establish convergence in cohypomonotone problems even when the base optimizer is taken to be gradient descent ascent.