Auto-tuning Neural Network Quantization Framework for Collaborative Inference Between the Cloud and Edge

Open in new window