World ModelBench: Judging Video Generation Models As World Models
–Neural Information Processing Systems
Video generation models have rapidly progressed, positioning themselves as video world models capable of supporting decision-making applications like robotics and autonomous driving. However, current benchmarks fail to rigorously evaluate these claims, focusing only on general video quality, ignoring important factors to world models such as physics adherence. To bridge this gap, we propose WorldModelBench, a benchmark designed to evaluate the world modeling capabilities of video generation models in application-driven domains. WorldModelBench offers two key advantages: (1) Against to nuanced world modeling violations: By incorporating instruction-following and physics-adherence dimensions, WorldModelBench detects subtle violations, such as irregular changes in object size that breach the mass conservation law--issues overlooked by prior benchmarks.
Neural Information Processing Systems
Jun-17-2026, 05:16:36 GMT
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Information Technology (0.66)
- Technology:
- Information Technology > Artificial Intelligence
- Robots (1.00)
- Representation & Reasoning (1.00)
- Cognitive Science > Problem Solving (0.92)
- Natural Language
- Large Language Model (1.00)
- Chatbot (0.68)
- Machine Learning > Neural Networks
- Deep Learning (0.46)
- Information Technology > Artificial Intelligence