Policy-regularized Offline Multi-objective Reinforcement Learning