Dual Behavior Regularized Reinforcement Learning