Exploration via Hindsight Goal Generation

Neural Information Processing Systems 

However, the sparsity of such reward definition makes traditional reinforcement learning algorithms very inefficient.