Multi-teacher knowledge distillation as an effective method for compressing ensembles of neural networks

Open in new window