Effective Reinforcement Learning Control using Conservative Soft Actor-Critic