Subgoal-Guided Policy Heuristic Search with Learned Subgoals