Towards a Certificate of Trust: Task-Aware OOD Detection for Scientific AI

Raonić, Bogdan, Mishra, Siddhartha, Lanthaler, Samuel

Sep-30-2025–arXiv.org Artificial Intelligence

Data-driven models are increasingly adopted in critical scientific fields like weather forecasting and fluid dynamics. These methods can fail on out-of-distribution (OOD) data, but detecting such failures in regression tasks is an open challenge. We propose a new OOD detection method based on estimating joint likelihoods using a score-based diffusion model. This approach considers not just the input but also the regression model's prediction, providing a task-aware reliability score. Across numerous scientific datasets, including PDE datasets, satellite imagery and brain tumor segmentation, we show that this likelihood strongly correlates with prediction error. Our work provides a foundational step towards building a verifiable 'certificate of trust', thereby offering a practical tool for assessing the trustworthiness of AI-based scientific predictions. Our code is publicly available at https://github.com/bogdanraonic3/OOD_Detection_ScientificML

artificial intelligence, machine learning, task-aware ood detection, (15 more...)

arXiv.org Artificial Intelligence

Sep-30-2025

arXiv.org PDF

Add feedback

Genre:
- Research Report > New Finding (0.67)

Industry:
- Health & Medicine
  - Diagnostic Medicine > Imaging (0.68)
  - Health Care Technology (0.46)
  - Therapeutic Area (0.46)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (0.93)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Vision (0.93)
    - Machine Learning
      - Statistical Learning (1.00)
      - Performance Analysis > Accuracy (0.68)
      - Neural Networks > Deep Learning (0.46)