WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation
–Neural Information Processing Systems
Recent advances in text-to-video (T2V) generation, exemplified by models such as Sora and Kling, have demonstrated strong potential for constructing world 3.Liquid motion 9.Vaposimulators.rization However, existing T2V models still struggle to understand abstract physical principles and to generate videos that faithfully obey physical laws.
Neural Information Processing Systems
Jun-14-2026, 12:36:01 GMT
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Research Report
- Industry:
- Media (0.69)
- Information Technology (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Vision (1.00)
- Machine Learning > Neural Networks (1.00)
- Representation & Reasoning (0.93)
- Natural Language > Large Language Model (0.69)
- Information Technology > Artificial Intelligence