Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement Learning

Neural Information Processing Systems 

Operating in the real-world often requires agents to learn about a complex environment and apply this understanding to achieve a breadth of goals.