Fast Matrix Multiplications for Lookup Table-Quantized LLMs

Open in new window