Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Su Young Lee, Choi Sungik, Sae-Young Chung

Neural Information Processing Systems 

Neural Information Processing Systems http://nips.cc/