Policy Regularization with Dataset Constraint for Offline Reinforcement Learning