Sinkhorn Distance Minimization for Knowledge Distillation