Efficient Video Generation on Complex Datasets