Dual-Head Knowledge Distillation: Enhancing Logits Utilization with an Auxiliary Head

Open in new window