MrT5: Dynamic Token Merging for Efficient Byte-level Language Models

Open in new window