Speedy Performance Estimation for Neural Architecture Search

Oct-9-2024, 18:06:01 GMT–Neural Information Processing Systems

Reliable yet efficient evaluation of generalisation performance of a proposed architecture is crucial to the success of neural architecture search (NAS). Traditional approaches face a variety of limitations: training each architecture to completion is prohibitively expensive, early stopped validation accuracy may correlate poorly with fully trained performance, and model-based estimators require large training sets. We instead propose to estimate the final test performance based on a simple measure of training speed. Our estimator is theoretically motivated by the connection between generalisation and training speed, and is also inspired by the reformulation of a PAC-Bayes bound under the Bayesian setting. Our model-free estimator is simple, efficient, and cheap to implement, and does not require hyperparameter-tuning or surrogate training before deployment.

estimator, neural architecture search, speedy performance estimation, (2 more...)

Neural Information Processing Systems

Oct-9-2024, 18:06:01 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Cognitive Science (0.79)
  - Systems & Languages > Problem-Independent Architectures (0.65)
  - Machine Learning > Neural Networks (0.65)