REx: Data-Free Residual Quantization Error Expansion

Neural Information Processing Systems 

Deep neural networks (DNNs) are ubiquitous in computer vision and natural language processing, but suffer from high inference cost.