Policy Continuation with Hindsight Inverse Dynamics

Hao Sun, Zhizhong Li, Xiaotong Liu, Bolei Zhou, Dahua Lin

Neural Information Processing Systems 

Solving goal-oriented tasks is an important but challenging problem in reinforcement learning (RL).