Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models

Open in new window