Sinkhorn Distance Minimization for Knowledge Distillation

Open in new window