RLVR-World: Training World Models with Reinforcement Learning

Neural Information Processing Systems 

World models predict state transitions in response to actions and are increasingly developed across diverse modalities.