DEEDEE: Fast and Scalable Out-of-Distribution Dynamics Detection
Aljaafari, Tala, Kanade, Varun, Torr, Philip, de Witt, Christian Schroeder
–arXiv.org Artificial Intelligence
Deploying reinforcement learning (RL) in safety-critical settings is constrained by brittleness under distribution shift. We study out-of-distribution (OOD) detection for RL time series and introduce DEEDEE, a two-statistic detector that revisits representation-heavy pipelines with a minimal alternative. DEEDEE uses only an episodewise mean and an RBF kernel similarity to a training summary, capturing complementary global and local deviations. Despite its simplicity, DEEDEE matches or surpasses contemporary detectors across standard RL OOD suites, delivering a 600-fold reduction in compute (FLOPs / wall-time) and an average 5% absolute accuracy gain over strong baselines. Conceptually, our results indicate that diverse anomaly types often imprint on RL trajectories through a small set of low-order statistics, suggesting a compact foundation for OOD detection in complex environments.
arXiv.org Artificial Intelligence
Oct-27-2025
- Country:
- Asia
- Japan > Honshū
- Chūgoku > Shimane Prefecture > Matsue (0.04)
- Middle East > Jordan (0.04)
- Japan > Honshū
- Europe > United Kingdom
- England
- Greater London > London (0.04)
- Oxfordshire > Oxford (0.28)
- England
- Asia
- Genre:
- Research Report > New Finding (0.48)
- Industry:
- Information Technology (0.46)
- Technology: