Distributional Reinforcement Learning based Integrated Decision Making and Control for Autonomous Surface Vehicles

Lin, Xi, Szenher, Paul, Huang, Yewei, Englot, Brendan

arXiv.org Artificial Intelligence 

Abstract--With the growing demands for Autonomous Surface Vehicles (ASVs) in recent years, the number of ASVs being deployed for various maritime missions is expected to increase rapidly in the near future. However, it is still challenging for ASVs to perform sensor-based autonomous navigation in obstacle-filled and congested waterways, where perception errors, closely gathered vehicles and limited maneuvering space near buoys may cause difficulties in following the Convention on the International Regulations for Preventing Collisions at Sea (COLREGs). To address these issues, we propose a novel Distributional Reinforcement Learning based navigation system that can work with onboard LiDAR and odometry sensors to generate arbitrary thrust commands in continuous action space. Comprehensive evaluations of the proposed system in highfidelity Gazebo simulations show its ability to decide whether to follow COLREGs or take other beneficial actions based on the scenarios encountered, offering superior performance in navigation safety and efficiency compared to systems using stateof-the-art Distributional RL, non-Distributional RL and classical methods. Figure 1: The proposed navigation system.