Robust Quantization: One Model to Rule Them All

Shkolnik, Moran, Chmiel, Brian, Banner, Ron, Shomron, Gil, Nahshan, Yuri, Bronstein, Alex, Weiser, Uri

Feb-18-2020–arXiv.org Machine Learning

Neural network quantization methods often involve simulating the quantization process during training. This makes the trained model highly dependent on the precise way quantization is performed. Since low-precision accelerators differ in their quantization policies and their supported mix of data-types, a model trained for one accelerator may not be suitable for another. To address this issue, we propose KURE, a method that provides intrinsic robustness to the model against a broad range of quantization implementations. We show that KURE yields a generic model that may be deployed on numerous inference accelerators without a significant loss in accuracy.

artificial intelligence, machine learning, quantization, (16 more...)

arXiv.org Machine Learning

Feb-18-2020

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East > Israel (0.04)

Genre:
- Research Report (0.51)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found