CLARIFY: Contrastive Preference Reinforcement Learning for Untangling Ambiguous Queries