Beyond Single-Step Updates: Reinforcement Learning of Heuristics with Limited-Horizon Search

Open in new window