Explaining and Mitigating Crosslingual Tokenizer Inequities

Open in new window