AMiD: Knowledge Distillation for LLMs with $α$-mixture Assistant Distribution

Open in new window