SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks

Open in new window