Provable Offline Preference-Based Reinforcement Learning