CRVQ: Channel-relaxed Vector Quantization for Extreme Compression of LLMs

Open in new window