DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs
–Neural Information Processing Systems
Quantization of large language models (LLMs) faces significant challenges, particularly due to the presence of outlier activations that impede efficient low-bit representation.
Neural Information Processing Systems
Nov-19-2025, 23:37:18 GMT
- Genre:
- Research Report
- Experimental Study (0.93)
- New Finding (1.00)
- Research Report
- Industry:
- Government (0.45)
- Technology: