Routing Recovery for UAV Networks with Deliberate Attacks: A Reinforcement Learning based Approach