Towards Efficient Post-training Quantization of Pre-trained Language Models

Neural Information Processing Systems 

Post-training quantization (PTQ), on the other hand, serves as an appealing alternative that is efficient in training time, memory overhead and data consumption.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found