Entropy-Driven Mixed-Precision Quantization for Deep Network Design
–Neural Information Processing Systems
Deploying deep convolutional neural networks on Internet-of-Things (IoT) devices is challenging due to the limited computational resources, such as limited SRAM memory and Flash storage. Previous works re-design a small network for IoT devices, and then compress the network size by mixed-precision quantization.
Neural Information Processing Systems
Dec-24-2025, 16:51:24 GMT
- Technology: