Memory-Efficient Gradient Unrolling for Large-Scale Bi-level Optimization

Neural Information Processing Systems 

U, which achieves an unbiased stochastic approximation of the meta gradient for bi-level optimization.