A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training

Open in new window