Neural Network Quantization for Efficient Inference: A Survey

Open in new window