Visual Interaction Networks: Learning a Physics Simulator from Video