Learning to Recharge: UAV Coverage Path Planning through Deep Reinforcement Learning
Theile, Mirco, Bayerlein, Harald, Caccamo, Marco, Sangiovanni-Vincentelli, Alberto L.
–arXiv.org Artificial Intelligence
Coverage path planning (CPP) is a critical problem in robotics, where the goal is to find an efficient path that covers every point in an area of interest. This work addresses the power-constrained CPP problem with recharge for battery-limited unmanned aerial vehicles (UAVs). In this problem, a notable challenge emerges from integrating recharge journeys into the overall coverage strategy, highlighting the intricate task of making strategic, long-term decisions. We propose a novel proximal policy optimization (PPO)-based deep reinforcement learning (DRL) approach with map-based observations, utilizing action masking and discount factor scheduling to optimize coverage trajectories over the entire mission horizon. We further provide the agent with a position history to handle emergent state loops caused by the recharge capability. Our approach outperforms a baseline heuristic, generalizes to different target zones and maps, with limited generalization to unseen maps. We offer valuable insights into DRL algorithm design for long-horizon problems and provide a publicly available software framework for the CPP problem.
arXiv.org Artificial Intelligence
Sep-7-2023
- Country:
- Europe > Germany
- Bavaria > Upper Bavaria > Munich (0.04)
- North America > United States
- California > Alameda County > Berkeley (0.04)
- Europe > Germany
- Genre:
- Research Report (0.82)
- Industry:
- Aerospace & Defense > Aircraft (0.34)
- Information Technology > Robotics & Automation (0.48)
- Transportation (1.00)
- Technology: