Preference VLM: Leveraging VLMs for Scalable Preference-Based Reinforcement Learning