ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers

Open in new window