VeLO: Training Versatile Learned Optimizers by Scaling Up

Open in new window