Entropy-Driven Pre-Tokenization for Byte-Pair Encoding