BASE-Q: Bias and Asymmetric Scaling Enhanced Rotational Quantization for Large Language Models

Open in new window