Efficient Reinforcemen Learning via Decoupling Exploration and Utilization