Back to Bytes: Revisiting Tokenization Through UTF-8