Vision-Language Models as Success Detectors

Open in new window