Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards