Width-based Lookaheads with Learnt Base Policies and Heuristics Over the Atari-2600 Benchmark

Neural Information Processing Systems 

We propose new width-based planning and learning algorithms inspired from a careful analysis of the design decisions made by previous width-based planners.