Exploration-Guided Reward Shaping for Reinforcement Learning under Sparse Rewards

Apr-25-2026, 03:48:35 GMT–Neural Information Processing Systems

We study the problem of reward shaping to accelerate the training process of a reinforcement learning agent. Existing works have considered a number of different reward shaping formulations; however, they either require external domain knowledge or fail in environments with extremely sparse rewards. In this paper, we propose a novel framework, Exploration-Guided Reward Shaping (EXPLORS), that operates in a fully self-supervised manner and can accelerate an agent's learning even in sparse-reward environments. The key idea of EXPLORS is to learn an intrinsic reward function in combination with exploration-based bonuses to maximize the agent's utility w.r.t.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Apr-25-2026, 03:48:35 GMT

Conferences PDF

Add feedback

Country:
- Europe (0.46)

Genre:
- Research Report > New Finding (0.93)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
266c0f191b04cbbbe529016d0edc847e-Supplemental-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found