HyT-NAS: Hybrid Transformers Neural Architecture Search for Edge Devices
Mecharbat, Lotfi Abdelkrim, Benmeziane, Hadjer, Ouarnoughi, Hamza, Niar, Smail
–arXiv.org Artificial Intelligence
Vision Transformers have enabled recent attention-based Deep Learning (DL) architectures to achieve remarkable results in Computer Vision (CV) tasks. However, due to the extensive computational resources required, these architectures are rarely implemented on resource-constrained platforms. Current research investigates hybrid handcrafted convolution-based and attention-based models for CV tasks such as image classification and object detection. In this paper, we propose HyT-NAS, an efficient Hardware-aware Neural Architecture Search (HW-NAS) including hybrid architectures targeting vision tasks on tiny devices. HyT-NAS improves state-of-the-art HW-NAS by enriching the search space and enhancing the search strategy as well as the performance predictors. Our experiments show that HyT-NAS achieves a similar hypervolume with less than ~5x training evaluations. Our resulting architecture outperforms MLPerf MobileNetV1 by 6.3% accuracy improvement with 3.5x less number of parameters on Visual Wake Words.
arXiv.org Artificial Intelligence
Mar-28-2023
- Country:
- North America
- United States
- Utah > Salt Lake County
- Salt Lake City (0.04)
- California > San Francisco County
- San Francisco (0.14)
- Utah > Salt Lake County
- Canada > Quebec
- Montreal (0.05)
- United States
- Europe
- France > Hauts-de-France (0.04)
- Austria (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Asia > Middle East
- UAE > Sharjah Emirate > Sharjah (0.04)
- Africa > Middle East
- Algeria > Algiers Province > Algiers (0.04)
- North America
- Genre:
- Research Report (0.64)
- Technology: