MergeQuant: Accurate 4-bit Static Quantization of Large Language Models by Channel-wise Calibration

Open in new window