LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale Tim Dettmers λ Mike Lewis Luke Zettlemoyer λ

Open in new window