EvoCoT: Overcoming the Exploration Bottleneck in Reinforcement Learning