Preference Elicitation for Offline Reinforcement Learning