What If the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-modal Language Models

Open in new window