Compact Language Models via Pruning and Knowledge Distillation

Open in new window