Width-based Lookaheads with Learnt Base Policies and Heuristics Over the Atari-2600 Benchmark

Open in new window