Trust, or Don't Predict: Introducing the CWSA Family for Confidence-Aware Model Evaluation