Supplementary Materials: Training Stronger Baselines for Learning to Optimize Tianlong Chen
–Neural Information Processing Systems
L2O-DM-CL donates the enhanced L2O-DM with our proposed curriculum learning technique. All learnable optimizers are trained with 5000 epochs. The results are presented in figure A2. We observe that the model trained by curriculum learning outperforms the two baselines (i.e., L2O-DM and L2O-DM-AUG) with Curves are the average of ten runs. Evaluation performance of our enhanced L2O and previous SOT As (i.e., log training loss v.s.
Neural Information Processing Systems
Oct-2-2025, 22:28:43 GMT
- Country:
- Asia > China
- North America
- Canada > Ontario
- Toronto (0.05)
- United States > Texas
- Travis County > Austin (0.05)
- Canada > Ontario
- Genre:
- Research Report > New Finding (0.36)
- Technology: