Quantization of Large Language Models with an Overdetermined Basis

Open in new window