Video Prediction Models as Rewards for Reinforcement Learning

Neural Information Processing Systems 

Specifying reward signals that allow agents to learn complex behaviors is a longstanding challenge in reinforcement learning.