Provable Acceleration of Nesterov's Accelerated Gradient for Rectangular Matrix Factorization and Linear Neural Networks

Neural Information Processing Systems 

We study the convergence rate of first-order methods for rectangular matrix factorization, which is a canonical nonconvex optimization problem.