Reinforcement Learning for UAV control with Policy and Reward Shaping