Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge