Accelerator-aware Neural Network Design using AutoML

Mar-5-2020–arXiv.org Machine Learning

While neural network hardware accelerators provide a substantial amount of raw compute throughput, the models deployed on them must be co-designed for the underlying hardware architecture to obtain the optimal system performance. We present a class of computer vision models designed using hardware-aware neural architecture search and customized to run on the Edge TPU, Google's neural network hardware accelerator for low-power, edge devices. For the Edge TPU in Coral devices, these models enable real-time image classification performance while achieving accuracy typically seen only with larger, compute-heavy models running in data centers. On Pixel 4's Edge TPU, these models improve the accuracy-latency tradeoff over existing SoTA mobile models.

architecture, latency, search space, (10 more...)

arXiv.org Machine Learning

Mar-5-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States > California > Santa Clara County > Mountain View (0.04)

Genre:
- Research Report (0.40)

Industry:
- Information Technology (0.34)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found