A Long N-step Surrogate Stage Reward for Deep Reinforcement Learning Junmin Zhong Arizona State University Ruofan Wu Arizona State University Jennie Si Arizona State University
–Neural Information Processing Systems
DDPG (TD3) [7], have demonstrated their great potential. Contributions . 1) We introduce a new, simple yet effective surrogate reward They usually proceed as follows.
Neural Information Processing Systems
Oct-8-2025, 08:19:09 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- North America > United States
- Arizona (1.00)
- Asia > Middle East
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Leisure & Entertainment > Games (0.93)
- Technology: