Unsupervised Learning for Physical Interaction through Video Prediction

Chelsea Finn, Ian Goodfellow, Sergey Levine

Neural Information Processing Systems 

A core challenge for an agent learning to interact with the world is to predict how its actions affect objects in its environment.