A Single Character can Make or Break Your LLM Evals