Over-parameterized Student Model via Tensor Decomposition Boosted Knowledge Distillation

Open in new window