QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs

Neural Information Processing Systems 

Code is available at github.com/spcl/QuaRot .