The Curse of Unrolling: Rate of Differentiating Through Optimization

Neural Information Processing Systems 

Computing the Jacobian of the solution of an optimization problem is a central problem in machine learning, with applications in hyperparameter optimization, meta-learning, optimization as a layer, and dataset distillation, to name a few.