Knowledge Distillation Performs Partial Variance Reduction