On the Unreasonable Effectiveness of Knowledge Distillation: Analysis in the Kernel Regime

Open in new window