NAT: Neural Architecture Transformer for Accurate and Compact Architectures

Guo, Yong, Zheng, Yin, Tan, Mingkui, Chen, Qi, Chen, Jian, Zhao, Peilin, Huang, Junzhou

Neural Information Processing Systems 

Designing effective architectures is one of the key factors behind the success of deep neural networks. Existing deep architectures are either manually designed or automatically searched by some Neural Architecture Search (NAS) methods. However, even a well-searched architecture may still contain many non-significant or redundant modules or operations (e.g., convolution or pooling), which may not only incur substantial memory consumption and computation cost but also deteriorate the performance. Thus, it is necessary to optimize the operations inside an architecture to improve the performance without introducing extra computation cost. Unfortunately, such a constrained optimization problem is NP-hard.