QuIP: 2-Bit Quantization of Large Language Models With Guarantees
–Neural Information Processing Systems
We introduce quantization with incoherence processing (QuIP), a new method based on the insight that quantization benefits from incoherent weight and Hessian matrices, i.e., from the weights being even in magnitude and the
Neural Information Processing Systems
Oct-8-2025, 03:06:51 GMT
- Country:
- Europe > Germany
- Berlin (0.04)
- North America > United States
- California > San Diego County
- San Diego (0.04)
- New Jersey (0.04)
- California > San Diego County
- Oceania > Australia
- Europe > Germany
- Technology: