Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs