QIGen: Generating Efficient Kernels for Quantized Inference on Large Language Models

Open in new window