CodeGEMM: A Codebook-Centric Approach to Efficient GEMM in Quantized LLMs

Open in new window