Consistent Accelerated Inference via Confident Adaptive Transformers

Open in new window