Towards Efficient Pre-Trained Language Model via Feature Correlation Distillation
–Neural Information Processing Systems
Therefore, a series of attempts Chung et al. [2020], Wu et al. [2020], Wang et al. [2020c], Gordon et al. [2020a], Tang et al. [2019], Aguilar et al. [2019] have been made to review the techniques for effective
Neural Information Processing Systems
Feb-9-2026, 23:14:52 GMT