On Importance of Layer Pruning for Smaller BERT Models and Low Resource Languages

Open in new window