LCQ: Low-Rank Codebook based Quantization for Large Language Models

Open in new window