RSAVQ: Riemannian Sensitivity-Aware Vector Quantization for Large Language Models

Open in new window