Video Prediction Models as Rewards for Reinforcement Learning Alejandro Escontrela Ademi Adeniji Wilson Y an