Preference VLM: Leveraging VLMs for Scalable Preference-Based Reinforcement Learning

Open in new window