BiSup: Bidirectional Quantization Error Suppression for Large Language Models

Open in new window