Understanding and Accelerating Neural Architecture Search with Training-Free and Theory-Grounded Metrics

Chen, Wuyang, Gong, Xinyu, Wu, Junru, Wei, Yunchao, Shi, Humphrey, Yan, Zhicheng, Yang, Yi, Wang, Zhangyang

Dec-29-2022–arXiv.org Artificial Intelligence

NAS has been explosively studied to automate the discovery of top-performer neural networks, but suffers from heavy resource consumption and often incurs search bias due to truncated training or approximations. Recent NAS works [1], [2], [3] start to explore indicators that can predict a network's performance without training. However, they either leveraged limited properties of deep networks, or the benefits of their training-free indicators are not applied to more extensive search methods. By rigorous correlation analysis, we present a unified framework to understand and accelerate NAS, by disentangling "TEG" characteristics of searched networks - Trainability, Expressivity, Generalization - all assessed in a training-free manner. The TEG indicators could be scaled up and integrated with various NAS search methods, including both supernet and single-path NAS approaches. Extensive studies validate the effective and efficient guidance from our TEG-NAS framework, leading to both improved search accuracy and over 56% reduction in search time cost. Moreover, we visualize search trajectories on three landscapes of "TEG" characteristics, observing that a good local minimum is easier to find on NAS-Bench-201 given its simple topology, whereas balancing "TEG" characteristics is much harder on the DARTS space due to its complex landscape geometry.

architecture, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

Dec-29-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States > Texas (0.28)

Genre:
- Research Report (0.64)

Industry:
- Materials > Chemicals
  - Industrial Gases > Liquified Gas (0.67)
  - Commodity Chemicals > Petrochemicals
    - LNG (0.67)
- Energy > Oil & Gas
  - Midstream (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Search (1.00)
  - Machine Learning > Neural Networks (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found