Bipartite Ranking From Multiple Labels: On Loss Versus Label Aggregation
Lukasik, Michal, Chen, Lin, Narasimhan, Harikrishna, Menon, Aditya Krishna, Jitkrittum, Wittawat, Yu, Felix X., Reddi, Sashank J., Fu, Gang, Bateni, Mohammadhossein, Kumar, Sanjiv
Bipartite ranking is a fundamental supervised learning problem, with the goal of learning a ranking over instances with maximal area under the ROC curve (AUC) against a single binary target label. However, one may often observe multiple binary target labels, e.g., from distinct human annotators. How can one synthesize such labels into a single coherent ranking? In this work, we formally analyze two approaches to this problem -- loss aggregation and label aggregation -- by characterizing their Bayes-optimal solutions. Based on this, we show that while both methods can yield Pareto-optimal solutions, loss aggregation can exhibit label dictatorship: one can inadvertently (and undesirably) favor one label over others. This suggests that label aggregation can be preferable to loss aggregation, which we empirically verify.
Apr-15-2025
- Country:
- Europe
- Italy (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- North America > United States
- Georgia > Fulton County
- Atlanta (0.04)
- Massachusetts
- Middlesex County > Cambridge (0.04)
- Suffolk County > Boston (0.04)
- New Jersey > Mercer County
- Princeton (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- Georgia > Fulton County
- Europe
- Genre:
- Research Report (0.40)
- Industry:
- Education (0.34)
- Technology: