FlowMo: Variance-Based Flow Guidance for Coherent Motion in Video Generation

Jun-12-2026, 20:56:51 GMT–Neural Information Processing Systems

Text-to-video diffusion models are notoriously limited in their ability to model temporal aspects such as motion, physics, and dynamic interactions. Existing approaches address this limitation by retraining the model or introducing external conditioning signals to enforce temporal consistency. In this work, we explore whether a meaningful temporal representation can be extracted directly from the predictions of a pre-trained model without any additional training or auxiliary inputs.

artificial intelligence, machine learning, proceedings, (7 more...)

Neural Information Processing Systems

Jun-12-2026, 20:56:51 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.43)