Iterative Layer Pruning for Efficient Translation Inference

Open in new window