Generalization is not a universal guarantee: Estimating similarity to training data with an ensemble out-of-distribution metric
Schreyer, W. Max, Anderson, Christopher, Thompson, Reid F.
–arXiv.org Artificial Intelligence
Failure of machine learning models to generalize to new data is a core problem limiting the reliability of AI systems, partly due to the lack of simple and robust methods for comparing new data to the original training dataset. We propose a standardized approach for assessing data similarity in a model-agnostic manner by constructing a supervised autoencoder for generalizability estimation (SAGE). We compare points in a low-dimensional embedded latent space, defining empirical probability measures for k -Nearest Neighbors (kNN) distance, reconstruction of inputs and task-based performance. As proof of concept for classification tasks, we use MNIST and CIFAR-10 to demonstrate how an ensemble output probability score can separate deformed images from a mixture of typical test examples, and how this SAGE score is robust to transformations of increasing severity. As further proof of concept, we extend this approach to a regression task using non-imaging data (UCI Abalone). In all cases, we show that out-of-the-box model performance increases after SAGE score filtering, even when applied to data from the model's own training and test datasets. Our out-of-distribution scoring method can be introduced during several steps of model construction and assessment, leading to future improvements in responsible deep learning implementation. 1 Background The presence of generalization gaps, where machine learning performance degrades when a trained model encounters previously-unseen data, represents a critical ongoing challenge in the implementation of AI systems.
arXiv.org Artificial Intelligence
Feb-25-2025
- Country:
- Indian Ocean > Bass Strait (0.04)
- Oceania > Australia
- Tasmania (0.04)
- North America > United States
- Oregon > Multnomah County
- Portland (0.05)
- Massachusetts > Suffolk County
- Boston (0.04)
- Oregon > Multnomah County
- Genre:
- Research Report (0.82)
- Industry:
- Transportation (1.00)
- Health & Medicine (1.00)
- Government
- Technology: