SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs

Open in new window