Goto

Collaborating Authors

 Optimization


Shadowing Properties of Optimization Algorithms

Neural Information Processing Systems

Analyzing the convergence properties of these algorithms can be complex, especially for NAG whose convergence proof relies on algebraic tricks that reveal little detail about the acceleration phenomenon, i.e. the celebrated optimality of NAG in convex smooth optimization. Instead, an alternative approach is to view these methods as numerical integrators of some ordinary differential equations (ODEs).










Direct Runge-Kutta Discretization Achieves Acceleration

Neural Information Processing Systems

We study gradient-based optimization methods obtained by directly discretizing a second-order ordinary differential equation (ODE) related to the continuous limit of Nesterov's accelerated gradient method.