Confidence Scoring Using Whitebox Meta-models with Linear Classifier Probes

Open in new window