Towards Efficient Pre-Trained Language Model via Feature Correlation Distillation

Neural Information Processing Systems 

Therefore, a series of attempts Chung et al. [2020], Wu et al. [2020], Wang et al. [2020c], Gordon et al. [2020a], Tang et al. [2019], Aguilar et al. [2019] have been made to review the techniques for effective

Similar Docs  Excel Report  more

TitleSimilaritySource
None found