A Long N-step Surrogate Stage Reward for Deep Reinforcement Learning Junmin Zhong Arizona State University Ruofan Wu Arizona State University Jennie Si Arizona State University