R2Q: Towards Robust 2-Bit Large Language Models via Residual Refinement Quantization

Open in new window