What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?

Open in new window