Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models

Open in new window