Multi-Group Fairness Evaluation via Conditional Value-at-Risk Testing