Diagnostic Tool for Out-of-Sample Model Evaluation

Hult, Ludvig, Zachariah, Dave, Stoica, Petre

Oct-16-2023–arXiv.org Machine Learning

Assessment of model fitness is a key part of machine learning. The standard paradigm of model evaluation is analysis of the average loss over future data. This is often explicit in model fitting, where we select models that minimize the average loss over training data as a surrogate, but comes with limited theoretical guarantees. In this paper, we consider the problem of characterizing a batch of out-of-sample losses of a model using a calibration data set. We provide finite-sample limits on the out-of-sample losses that are statistically valid under quite general conditions and propose a diagonistic tool that is simple to compute and interpret. Several numerical experiments are presented to show how the proposed method quantifies the impact of distribution shifts, aids the analysis of regression, and enables model selection as well as hyperparameter tuning.

artificial intelligence, machine learning, out-of-sample loss, (17 more...)

arXiv.org Machine Learning

Oct-16-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - Canada > Quebec (0.14)
  - United States > New York (0.14)

Genre:
- Research Report (0.82)

Industry:
- Health & Medicine (0.88)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning
    - Neural Networks (1.00)
    - Statistical Learning (1.00)
  - Data Science (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found