What Makes Quantization for Large Language Models Hard? An Empirical Study from the Lens of Perturbation

Open in new window