Single Agent Robust Deep Reinforcement Learning for Bus Fleet Control