3 deep learning mysteries: Ensemble, knowledge- and self-distillation