GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

Open in new window