Reinforcement Learning for Collision-free Flight Exploiting Deep Collision Encoding
Kulkarni, Mihir, Alexis, Kostas
–arXiv.org Artificial Intelligence
Abstract-- This work contributes a novel deep navigation policy that enables collision-free flight of aerial robots based on a modular approach exploiting deep collision encoding and reinforcement learning. The proposed solution builds upon a deep collision encoder that is trained on both simulated and real depth images using supervised learning such that it compresses the high-dimensional depth data to a low-dimensional latent space encoding collision information while accounting for the robot size. This compressed encoding is combined with an estimate of the robot's odometry and the desired target location to train a deep reinforcement learning navigation policy that offers low-latency computation and robust sim2real performance. A set of simulation and experimental studies in diverse environments are conducted and demonstrate the efficiency of the emerged behavior and its resilience in real-life deployments. Key to enabling resilient compressing them to a very low-dimensional latent space autonomy is identifying core functionalities that experience that retains information for collision building upon the significant impediments in their performance and designing principles of Variational Autoencoders (VAEs). In particular, novel approaches to overcome such limitations.
arXiv.org Artificial Intelligence
Feb-6-2024