Accurate Block Quantization in LLMs with Outliers

Open in new window