SKDBERT: Compressing BERT via Stochastic Knowledge Distillation

Open in new window