Reinforcement Learning for Agile Active Target Sensing with a UAV