Evaluatingmodelperformanceunderworst-case subpopulations
–Neural Information Processing Systems
The training population typically does not accurately represent what the model will encounter underoperation.
Neural Information Processing Systems
Feb-9-2026, 21:46:59 GMT