Classification Under Misspecification: Halfspaces, Generalized Linear Models, and Evolvability
–Neural Information Processing Systems
In this paper, we revisit the problem of distribution-independently learning halfspaces under Massart noise with rate \eta . Recent work resolved a long-standing problem in this model of efficiently learning to error \eta \epsilon for any \epsilon 0, by giving an improper learner that partitions space into \text{poly}(d,1/\epsilon) regions. Here we give a much simpler algorithm and settle a number of outstanding open questions: (1) We give the first \emph{proper} learner for Massart halfspaces that achieves \eta \epsilon . We then zoom out to study generalized linear models and give an efficient algorithm for learning under a challenging new corruption model generalizing Massart noise. Finally we study our algorithm for learning halfspaces under Massart noise empirically and find that it exhibits some appealing fairness properties as a byproduct of its strong provable robustness guarantees.
Neural Information Processing Systems
Oct-10-2024, 08:00:24 GMT
- Technology: