Accelerating Reinforcement Learning Training Using Simulation Surrogate Models