Latent Uncertainty Representations for Video-based Driver Action and Intention Recognition
Vellenga, Koen, Steinhauer, H. Joe, Andersson, Jonas, Sjögren, Anders
–arXiv.org Artificial Intelligence
Deep neural networks (DNNs) are increasingly applied to safety-critical tasks in resource-constrained environments, such as video-based driver action and intention recognition. While last layer probabilistic deep learning (LL-PDL) methods can detect out-of-distribution (OOD) instances, their performance varies. As an alternative to last layer approaches, we propose extending pre-trained DNNs with transformation layers to produce multiple latent representations to estimate the uncertainty. W e evaluate our latent uncertainty representation (LUR) and repulsively trained LUR (RLUR) approaches against eight PDL methods across four video-based driver action and intention recognition datasets, comparing classification performance, calibration, and uncertainty-based OOD detection. W e also contribute 28,000 frame-level action labels and 1,194 video-level intention labels for the NuScenes dataset. Our results show that LUR and RLUR achieve comparable in-distribution classification performance to other LL-PDL approaches. F or uncertainty-based OOD detection, LUR matches top-performing PDL methods while being more efficient to train and easier to tune than approaches that require Markov-Chain Monte Carlo sampling or repulsive training procedures.
arXiv.org Artificial Intelligence
Oct-7-2025
- Country:
- North America > United States (0.67)
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Government (0.92)
- Technology: