Exploration-Guided Reward Shaping for Reinforcement Learning under Sparse Rewards Max Planck Institute for Software Systems (MPI-SWS), Saarbrucken, Germany
–Neural Information Processing Systems
We study the problem of reward shaping to accelerate the training process of a reinforcement learning agent. Existing works have considered a number of different reward shaping formulations; however, they either require external domain knowledge or fail in environments with extremely sparse rewards.
Neural Information Processing Systems
Jan-25-2025, 03:50:57 GMT
- Country:
- Europe > Germany > Saarland > Saarbrücken (0.40)
- Genre:
- Research Report > New Finding (0.93)
- Technology: