Bipartite Ranking From Multiple Labels: On Loss Versus Label Aggregation

Lukasik, Michal, Chen, Lin, Narasimhan, Harikrishna, Menon, Aditya Krishna, Jitkrittum, Wittawat, Yu, Felix X., Reddi, Sashank J., Fu, Gang, Bateni, Mohammadhossein, Kumar, Sanjiv

Apr-15-2025–arXiv.org Machine Learning

Bipartite ranking is a fundamental supervised learning problem, with the goal of learning a ranking over instances with maximal area under the ROC curve (AUC) against a single binary target label. However, one may often observe multiple binary target labels, e.g., from distinct human annotators. How can one synthesize such labels into a single coherent ranking? In this work, we formally analyze two approaches to this problem -- loss aggregation and label aggregation -- by characterizing their Bayes-optimal solutions. Based on this, we show that while both methods can yield Pareto-optimal solutions, loss aggregation can exhibit label dictatorship: one can inadvertently (and undesirably) favor one label over others. This suggests that label aggregation can be preferable to loss aggregation, which we empirically verify.

aggregation, artificial intelligence, machine learning, (19 more...)

arXiv.org Machine Learning

Apr-15-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Wisconsin > Dane County
    - Madison (0.04)
  - New Jersey > Mercer County
    - Princeton (0.04)
  - Massachusetts
    - Suffolk County > Boston (0.04)
    - Middlesex County > Cambridge (0.04)
  - Georgia > Fulton County
    - Atlanta (0.04)
- Europe
  - Italy (0.04)
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)

Genre:
- Research Report (0.40)

Industry:
- Education (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (0.55)
  - Machine Learning
    - Performance Analysis > Accuracy (0.48)
    - Inductive Learning (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found