CodeGEMM: ACodebook-Centric Approach to Efficient GEMM in Quantized LLMs

Open in new window