MOGNET: A Mux-residual quantized Network leveraging Online-Generated weights

Nguyen, Van Thien, Guicquero, William, Sicard, Gilles

Jan-16-2025–arXiv.org Artificial Intelligence

This paper presents a compact model architecture called MOGNET, compatible with a resource-limited hardware. MOGNET uses a streamlined Convolutional factorization block based on a combination of 2 point-wise (1x1) convolutions with a group-wise convolution in-between. To further limit the overall model size and reduce the on-chip required memory, the second point-wise convolution's parameters are on-line generated by a Cellular Automaton structure. In addition, MOGNET enables the use of low-precision weights and activations, by taking advantage of a Multiplexer mechanism with a proper Bitshift rescaling for integrating residual paths without increasing the hardware-related complexity. To efficiently train this model we also introduce a novel weight ternarization method favoring the balance between quantized levels. Experimental results show that given tiny memory budget (sub-2Mb), MOGNET can achieve higher accuracy with a clear gap up to 1% at a similar or even lower model size compared to recent state-of-the-art methods.

activation, convolution, neural network, (16 more...)

arXiv.org Artificial Intelligence

Jan-16-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.04)
- Europe > France
  - Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)

Genre:
- Research Report > Promising Solution (0.34)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.72)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found