An Improved Yaw Control Algorithm for Wind Turbines via Reinforcement Learning