RepQuant: Towards Accurate Post-Training Quantization of Large Transformer Models via Scale Reparameterization

Open in new window