BERT on a Data Diet: Finding Important Examples by Gradient-Based Pruning

Open in new window