Plan-Based Relaxed Reward Shaping for Goal-Directed Tasks