SingleQuant: Efficient Quantization of Large Language Models in a Single Pass

Open in new window