SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models

Open in new window