Fast deep reinforcement learning using online adjustments from the past

Steven Hansen, Alexander Pritzel, Pablo Sprechmann, Andre Barreto, Charles Blundell

Neural Information Processing Systems 

Neural Information Processing Systems http://nips.cc/