Pre-Trained Video Generative Models as World Simulators