GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

Open in new window