Towards a Certificate of Trust: Task-Aware OOD Detection for Scientific AI
Raonić, Bogdan, Mishra, Siddhartha, Lanthaler, Samuel
–arXiv.org Artificial Intelligence
Data-driven models are increasingly adopted in critical scientific fields like weather forecasting and fluid dynamics. These methods can fail on out-of-distribution (OOD) data, but detecting such failures in regression tasks is an open challenge. We propose a new OOD detection method based on estimating joint likelihoods using a score-based diffusion model. This approach considers not just the input but also the regression model's prediction, providing a task-aware reliability score. Across numerous scientific datasets, including PDE datasets, satellite imagery and brain tumor segmentation, we show that this likelihood strongly correlates with prediction error. Our work provides a foundational step towards building a verifiable 'certificate of trust', thereby offering a practical tool for assessing the trustworthiness of AI-based scientific predictions. Our code is publicly available at https://github.com/bogdanraonic3/OOD_Detection_ScientificML
arXiv.org Artificial Intelligence
Sep-30-2025
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Health & Medicine
- Diagnostic Medicine > Imaging (0.68)
- Health Care Technology (0.46)
- Therapeutic Area (0.46)
- Health & Medicine
- Technology: