ParetoQ: Scaling Laws in Extremely Low-bit LLM Quantization

Open in new window