GPT3.int8(): 8-bit Matrix Multiplication for Transformers at Scale

Open in new window