MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling

Open in new window