CPTQuant - A Novel Mixed Precision Post-Training Quantization Techniques for Large Language Models

Open in new window