Exploit Reward Shifting in Value-Based Deep-RL: Optimistic Curiosity-Based Exploration and Conservative Exploitation via Linear Reward Shaping Hao Sun 1 Lei Han 2 Rui Y ang
–Neural Information Processing Systems
In this work, we study the simple yet universally applicable case of reward shaping in value-based Deep Reinforcement Learning (DRL).
Neural Information Processing Systems
Aug-19-2025, 20:20:01 GMT
- Country:
- Asia
- China > Hong Kong (0.04)
- Japan > Honshū
- Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
- Middle East > Jordan (0.04)
- Europe
- Netherlands > South Holland
- Delft (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Netherlands > South Holland
- North America > United States
- California > Los Angeles County
- Los Angeles (0.14)
- Illinois (0.04)
- California > Los Angeles County
- Asia
- Genre:
- Research Report (0.68)
- Technology: