PAC Learning Linear Thresholds from Label Proportions
–Neural Information Processing Systems
Learning from label proportions (LLP) is a generalization of supervised learning in which the training data is available as sets or bags of feature-vectors (instances) along with the average instance-label of each bag. The goal is to train a good instance classifier. While most previous works on LLP have focused on training models on such training data, computational learnability of LLP was only recently explored by [25, 26] who showed worst case intractability of properly learning linear threshold functions (LTFs) from label proportions. However, their work did not rule out efficient algorithms for this problem on natural distributions. In this work we show that it is indeed possible to efficiently learn LTFs using LTFs when given access to random bags of some label proportion in which featurevectors are, conditioned on their labels, independently sampled from a Gaussian distribution N(µ, Σ).
Neural Information Processing Systems
Feb-11-2025, 12:39:02 GMT