Belief Reward Shaping in Reinforcement Learning