Optimal Learning from Label Proportions with General Loss Functions
Applebaum, Lorne, Dick, Travis, Gentile, Claudio, Kaplan, Haim, Koren, Tomer
–arXiv.org Artificial Intelligence
Motivated by problems in online advertising, we address the task of Learning from Label Proportions (LLP). In this partially-supervised setting, training data consists of groups of examples, termed bags, for which we only observe the average label value. The main goal, however, remains the design of a predictor for the labels of individual examples. We introduce a novel and versatile low-variance de-biasing methodology to learn from aggregate label information, significantly advancing the state of the art in LLP. Our approach exhibits remarkable flexibility, seamlessly accommodating a broad spectrum of practically relevant loss functions across both binary and multi-class classification settings. By carefully combining our estimators with standard techniques, we substantially improve sample complexity guarantees for a large class of losses of practical relevance. We also empirically validate the efficacy of our proposed approach across a diverse array of benchmark datasets, demonstrating compelling empirical advantages over standard baselines.
arXiv.org Artificial Intelligence
Sep-19-2025
- Country:
- North America (0.15)
- Genre:
- Research Report (0.81)
- Industry:
- Information Technology > Services (0.48)
- Technology: