KWT-Tiny: RISC-V Accelerated, Embedded Keyword Spotting Transformer

Al-Qawlaq, Aness, M, Ajay Kumar, John, Deepu

Jul-22-2024–arXiv.org Artificial Intelligence

University College Dublin, Ireland Abstract -- This paper explores the adaptation of Transformer - based models for edge devices through the quantis ation and hardware acceleration of the ARM Keyword Transformer (KWT) model on a RISC - V platform. The model was targeted to run on 64kB RAM in bare - metal C using a custom - developed edge AI library. KWT - 1 was retrained to be 369 times smaller, with only a 10 % loss in accuracy through reducing output classes from 35 to 2. The retraining and quantis ation reduced model size from 2.42 MB to 1.65 kB. The integration of custom RISC - V instructions that accelerated GELU and SoftMax operations enabled a 5x speedup and thus ~5x power reduction in inference, with inference clock cycle counts decreasing from 26 million to 5.5 million clock cycles while incurring a small area overhead of approximately 29 % . The results demonstrate a viable method for porting and accelerating Transformer - based models in low - power IoT devices.

kwt, opération, transformer, (17 more...)

arXiv.org Artificial Intelligence

Jul-22-2024

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America > United States
  - Virginia (0.04)
  - California > San Diego County
    - San Diego (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.24)
- Asia > Japan
  - Kyūshū & Okinawa > Kyūshū > Fukuoka Prefecture > Fukuoka (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (0.88)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found