Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Neural Information Processing Systems 

We propose Episodic Backward Update (EBU) - a novel deep reinforcement learning algorithm with a direct value propagation.