Adaptive Pruning of Pretrained Transformer via Differential Inclusions

Open in new window