Can Vision Language Models Infer Human Gaze Direction? A Controlled Study

Open in new window