A Reinforcement Learning Approach for Robust Supervisory Control of UAVs Under Disturbances