ParetoQ: Improving Scaling Laws in Extremely Low-bit LLMQuantization

Open in new window