QTIP: Quantization with Trellises and Incoherence Processing

Feb-15-2026, 16:31:27 GMT–Neural Information Processing Systems

Post-training quantization (PTQ) reduces the memory footprint of LLMs by quan-tizing weights to low-precision datatypes. Since LLM inference is usually memory-bound, PTQ methods can improve inference throughput.

large language model, machine learning, quantization, (21 more...)

Neural Information Processing Systems

Feb-15-2026, 16:31:27 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.04)
- Europe > United Kingdom
  - England (0.04)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Information Technology (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning (1.00)
  - Representation & Reasoning (0.93)

Duplicate Docs Excel Report

Title
6de2e84b8da47bb2eb5e2ac96c63d2b0-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found