CBQ: Cross-Block Quantization for Large Language Models

Open in new window