Jayakumar, Paramsothy
ConvBKI: Real-Time Probabilistic Semantic Mapping Network with Quantifiable Uncertainty
Wilson, Joey, Fu, Yuewei, Friesen, Joshua, Ewen, Parker, Capodieci, Andrew, Jayakumar, Paramsothy, Barton, Kira, Ghaffari, Maani
In this paper, we develop a modular neural network for real-time semantic mapping in uncertain environments, which explicitly updates per-voxel probabilistic distributions within a neural network layer. Our approach combines the reliability of classical probabilistic algorithms with the performance and efficiency of modern neural networks. Although robotic perception is often divided between modern differentiable methods and classical explicit methods, a union of both is necessary for real-time and trustworthy performance. We introduce a novel Convolutional Bayesian Kernel Inference (ConvBKI) layer which incorporates semantic segmentation predictions online into a 3D map through a depthwise convolution layer by leveraging conjugate priors. We compare ConvBKI against state-of-the-art deep learning approaches and probabilistic algorithms for mapping to evaluate reliability and performance. We also create a Robot Operating System (ROS) package of ConvBKI and test it on real-world perceptually challenging off-road driving data.
Convolutional Bayesian Kernel Inference for 3D Semantic Mapping
Wilson, Joey, Fu, Yuewei, Zhang, Arthur, Song, Jingyu, Capodieci, Andrew, Jayakumar, Paramsothy, Barton, Kira, Ghaffari, Maani
Abstract-- Robotic perception is currently at a cross-roads between modern methods, which operate in an efficient latent space, and classical methods, which are mathematically founded and provide interpretable, trustworthy results. In this paper, we introduce a Convolutional Bayesian Kernel Inference (ConvBKI) layer which learns to perform explicit Bayesian inference within a depthwise separable convolution layer to maximize efficency while maintaining reliability simultaneously. We apply our layer to the task of real-time 3D semantic mapping, where we learn semantic-geometric probability distributions for Li-DAR sensor information and incorporate semantic predictions into a global map. The constructed semantic volumes are convolved with a depthwise filter to perform a real-time Bayesian update on a semantic 3D map. Robust world models are essential for safe and reliable the structured geometric representations of earlier, probabilistic autonomous robots.