Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Open in new window