Towards a Certificate of Trust: Task-Aware OOD Detection for Scientific AI
Raonić, Bogdan, Mishra, Siddhartha, Lanthaler, Samuel
–arXiv.org Artificial Intelligence
Data-driven models are increasingly adopted in critical scientific fields like weather forecasting and fluid dynamics. These methods can fail on out-of-distribution (OOD) data, but detecting such failures in regression tasks is an open challenge. We propose a new OOD detection method based on estimating joint likelihoods using a score-based diffusion model. This approach considers not just the input but also the regression model's prediction, providing a task-aware reliability score. Across numerous scientific datasets, including PDE datasets, satellite imagery and brain tumor segmentation, we show that this likelihood strongly correlates with prediction error. Our work provides a foundational step towards building a verifiable 'certificate of trust', thereby offering a practical tool for assessing the trustworthiness of AI-based scientific predictions. Our code is publicly available at https://github.com/bogdanraonic3/OOD_Detection_ScientificML
arXiv.org Artificial Intelligence
Sep-30-2025
- Country:
- Africa (0.04)
- Asia (0.14)
- Europe > Switzerland
- North America > United States
- Maryland > Prince George's County > Greenbelt (0.04)
- Oceania > Australia (0.04)
- South America (0.04)
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Health & Medicine
- Diagnostic Medicine > Imaging (0.68)
- Health Care Technology (0.46)
- Therapeutic Area (0.46)
- Health & Medicine
- Technology: