FP8-BERT: Post-Training Quantization for Transformer

Open in new window