Qu-ANTI-zation: Exploiting Quantization Artifacts for Achieving Adversarial Outcomes

Neural Information Processing Systems 

Quantization is a popular technique that transforms the parameter representation of a neural network from floating-point numbers into lower-precision ones (e.g., 8-bit integers). It reduces the memory footprint and the computational cost at inference, facilitating the deployment of resource-hungry models. However, the parameter perturbations caused by this transformation result in behavioral disparities between the model before and after quantization. For example, a quantized model can misclassify some test-time samples that are otherwise classified correctly. It is not known whether such differences lead to a new security vulnerability.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found