Exploration-Guided Reward Shaping for Reinforcement Learning under Sparse Rewards Max Planck Institute for Software Systems (MPI-SWS), Saarbrucken, Germany

Open in new window