Spotlight: Mobile UI Understanding using Vision-Language Models with a Focus

Open in new window