Large scale distributed neural network training through online distillation

Open in new window