AP-VLM: Active Perception Enabled by Vision-Language Models

Open in new window