Review for NeurIPS paper: Training Stronger Baselines for Learning to Optimize

Neural Information Processing Systems 

Summary and Contributions: This paper proposes changes to L2O to make a generic L2O algorithm easier and faster to train using common techniques like curriculum and imitation learning. They show their method give significant improvements across many existing evaluation criteria and methods. EDIT after rebuttal: Thank you very much for clarifying our concerns. I really appreciate running the additional experiments we requested and will increase my score. As an extra suggestion, please include the validation/test loss plots in your paper as opposed to training.