Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization

Open in new window