A Gradient Method for Multilevel Optimization

Neural Information Processing Systems 

In recent years, in machine learning, Franceschi et al. have proposed a method for solving bilevel optimization problems by replacing their lower-level problems with the