Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models Xiuying Wei

Open in new window