A Additional Related Work KD has been extensively applied to computer vision and NLP tasks [ 52