Tokenizer Choice For LLM Training: Negligible or Crucial?