Exploit Reward Shifting in Value-Based Deep-RL: Optimistic Curiosity-Based Exploration and Conservative Exploitation via Linear Reward Shaping Hao Sun 1 Lei Han 2 Rui Y ang

Neural Information Processing Systems 

In this work, we study the simple yet universally applicable case of reward shaping in value-based Deep Reinforcement Learning (DRL).

Similar Docs  Excel Report  more

TitleSimilaritySource
None found