VLMs have Tunnel Vision: Evaluating Nonlocal Visual Reasoning in Leading VLMs

Open in new window