Boosting Offline Reinforcement Learning with Action Preference Query