Optimizing Energy Efficiency in Metro Systems Under Uncertainty Disturbances Using Reinforcement Learning