SmoothQuant+: Accurate and Efficient 4-bit Post-Training WeightQuantization for LLM

Open in new window