Does the Model Say What the Data Says? A Simple Heuristic for Model Data Alignment

Salgado, Henry, Kendall, Meagan R., Ceberio, Martine

Dec-9-2025–arXiv.org Artificial Intelligence

In this work, we propose a simple and computationally efficient framework for evaluating whether machine learning models align with the structure of the data they learn from; that is, whether the model says what the data says. Unlike existing interpretability methods that focus exclusively on explaining model behavior, our approach establishes a baseline derived directly from the data itself. Drawing inspiration from Rubin's Potential Outcomes Framework, we quantify how strongly each feature separates the two outcome groups in a binary classification task, moving beyond traditional descriptive statistics to estimate each feature's effect on the outcome. By comparing these data-derived feature rankings with model-based explanations, we provide practitioners with an interpretable and model-agnostic method for assessing model-data alignment.

alignment, artificial intelligence, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Dec-9-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Texas (0.15)
- Europe > Austria
  - Vienna (0.14)

Genre:
- Research Report (0.64)

Industry:
- Health & Medicine
  - Therapeutic Area (0.98)
  - Diagnostic Medicine (0.69)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found