Bridging the Gap Between Multimodal Foundation Models and World Models

Open in new window