FlatQuant: Flatness Matters for LLM Quantization

Open in new window