Pruning and Quantization for Deep Neural Network Acceleration: A Survey

Open in new window