DSG-KD: Knowledge Distillation from Domain-Specific to General Language Models

Open in new window