Search on the Replay Buffer: Bridging Planning and Reinforcement Learning

Ben Eysenbach, Russ R. Salakhutdinov, Sergey Levine

Neural Information Processing Systems 

Neural Information Processing Systems http://nips.cc/