A Long N-step Surrogate Stage Reward for Deep Reinforcement Learning Junmin Zhong Arizona State University Ruofan Wu Arizona State University Jennie Si Arizona State University

Neural Information Processing Systems 

DDPG (TD3) [7], have demonstrated their great potential. Contributions . 1) We introduce a new, simple yet effective surrogate reward They usually proceed as follows.