What's in Common? Multimodal Models Hallucinate When Reasoning Across Scenes

Open in new window