Reverse engineering learned optimizers reveals known and novel mechanisms

Jan-18-2025, 12:26:56 GMT–Neural Information Processing Systems

Learned optimizers are parametric algorithms that can themselves be trained to solve optimization problems. In contrast to baseline optimizers (such as momentum or Adam) that use simple update rules derived from theoretical principles, learned optimizers use flexible, high-dimensional, nonlinear parameterizations. Although this can lead to better performance, their inner workings remain a mystery. How is a given learned optimizer able to outperform a well tuned baseline? Has it learned a sophisticated combination of existing optimization techniques, or is it implementing completely new behavior?

novel mechanism, optimizer, optimizer reveal, (1 more...)

Neural Information Processing Systems

Jan-18-2025, 12:26:56 GMT

Conferences Web Page

Add feedback

Industry:
- Education > Curriculum > Subject-Specific Education (0.40)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (0.68)
  - Representation & Reasoning > Optimization (0.64)