Non-Convex SGD Learns Halfspaces with Adversarial Label Noise

Mar-20-2025, 20:04:55 GMT–Neural Information Processing Systems

We study the problem of agnostically learning homogeneous halfspaces in the distribution-specific PAC model. For a broad family of structured distributions, including log-concave distributions, we show that non-convex SGD efficiently converges to a solution with misclassification error O(opt) + ɛ, where opt is the misclassification error of the best-fitting halfspace. In sharp contrast, we show that optimizing any convex surrogate inherently leads to misclassification error of ω(opt), even under Gaussian marginals.

artificial intelligence, machine learning, optimization problem, (17 more...)

Neural Information Processing Systems

Mar-20-2025, 20:04:55 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.29)

Genre:
- Research Report > New Finding (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Computational Learning Theory (0.68)
    - Statistical Learning (0.93)
  - Representation & Reasoning > Optimization (0.68)