Semantic Tokenizer for Enhanced Natural Language Processing