Can Pretrained Vision-Language Embeddings Alone Guide Robot Navigation?

Open in new window