QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs

Open in new window