Unifying and Boosting Gradient-Based Training-Free Neural Architecture Search

Jan-19-2025, 00:32:16 GMT–Neural Information Processing Systems

Neural architecture search (NAS) has gained immense popularity owing to its ability to automate neural architecture design. A number of training-free metrics are recently proposed to realize NAS without training, hence making NAS more scalable. Despite their competitive empirical performances, a unified theoretical understanding of these training-free metrics is lacking. As a consequence, (a) the relationships among these metrics are unclear, (b) there is no theoretical interpretation for their empirical performances, and (c) there may exist untapped potential in existing training-free NAS, which probably can be unveiled through a unified theoretical understanding. To this end, this paper presents a unified theoretical analysis of gradient-based training-free NAS, which allows us to (a) theoretically study their relationships, (b) theoretically guarantee their generalization performances, and (c) exploit our unified theoretical understanding to develop a novel framework named hybrid NAS (HNAS) which consistently boosts training-free NAS in a principled way.

artificial intelligence, gradient-based training-free neural architecture search, machine learning, (2 more...)

Neural Information Processing Systems

Jan-19-2025, 00:32:16 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Systems & Languages > Problem-Independent Architectures (0.90)
  - Machine Learning > Neural Networks (0.90)
  - Cognitive Science (0.90)