Bipartite Ranking From Multiple Labels: On Loss Versus Label Aggregation

Lukasik, Michal, Chen, Lin, Narasimhan, Harikrishna, Menon, Aditya Krishna, Jitkrittum, Wittawat, Yu, Felix X., Reddi, Sashank J., Fu, Gang, Bateni, Mohammadhossein, Kumar, Sanjiv

arXiv.org Machine Learning 

Bipartite ranking is a fundamental supervised learning problem, with the goal of learning a ranking over instances with maximal area under the ROC curve (AUC) against a single binary target label. However, one may often observe multiple binary target labels, e.g., from distinct human annotators. How can one synthesize such labels into a single coherent ranking? In this work, we formally analyze two approaches to this problem -- loss aggregation and label aggregation -- by characterizing their Bayes-optimal solutions. Based on this, we show that while both methods can yield Pareto-optimal solutions, loss aggregation can exhibit label dictatorship: one can inadvertently (and undesirably) favor one label over others. This suggests that label aggregation can be preferable to loss aggregation, which we empirically verify.