Preference-Guided Reinforcement Learning for Efficient Exploration