Towards Efficient In-memory Computing Hardware for Quantized Neural Networks: State-of-the-art, Open Challenges and Perspectives

Krestinskaya, Olga, Zhang, Li, Salama, Khaled Nabil

Jul-8-2023–arXiv.org Artificial Intelligence

The amount of data processed in the cloud, the development of Internet-of-Things (IoT) applications, and growing data privacy concerns force the transition from cloud-based to edge-based processing. Limited energy and computational resources on edge push the transition from traditional von Neumann architectures to In-memory Computing (IMC), especially for machine learning and neural network applications. Network compression techniques are applied to implement a neural network on limited hardware resources. Quantization is one of the most efficient network compression techniques allowing to reduce the memory footprint, latency, and energy consumption. This paper provides a comprehensive review of IMC-based Quantized Neural Networks (QNN) and links software-based quantization approaches to IMC hardware implementation. Moreover, open challenges, QNN design requirements, recommendations, and perspectives along with an IMC-based QNN hardware roadmap are provided.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Jul-8-2023

arXiv.org PDF

Add feedback

Country:
- Europe (0.04)
- North America > United States
  - New York > New York County > New York City (0.04)
- Asia > Middle East
  - Saudi Arabia (0.04)

Genre:
- Overview (0.86)

Industry:
- Information Technology > Security & Privacy (0.88)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found