Seeing Through Their Eyes: Evaluating Visual Perspective Taking in Vision Language Models

Open in new window