OvercomingCommonFlawsintheEvaluationof SelectiveClassificationSystems
–Neural Information Processing Systems
Whilecurrentevaluationofthese systems typically assumes fixed working points based on pre-defined rejection thresholds, methodological progress requires benchmarking the general performance of systems akin to the AUROC in standard classification. In this work, we define 5 requirements for multi-threshold metrics in selective classification regarding task alignment, interpretability, and flexibility, and show how current approaches fail to meet them.
Neural Information Processing Systems
Feb-7-2026, 07:09:07 GMT
- Country:
- Europe
- Germany (0.05)
- Spain > Andalusia
- Granada Province > Granada (0.04)
- North America > United States (0.04)
- Europe
- Genre:
- Research Report > Experimental Study (0.46)
- Industry:
- Health & Medicine
- Diagnostic Medicine (0.46)
- Therapeutic Area (0.46)
- Health & Medicine
- Technology: