QuantEase: Optimization-based Quantization for Language Models

Open in new window