Generalization by Recognizing Confusion
Chiu, Daniel, Wang, Franklyn, Kominers, Scott Duke
A recently-proposed technique called self-adaptive training augments modern neural networks by allowing them to adjust training labels on the fly, to avoid overfitting to samples that may be mislabeled or otherwise non-representative. By combining the self-adaptive objective with mixup, we further improve the accuracy of self-adaptive models for image recognition; the resulting classifier obtains state-of-the-art accuracies on datasets corrupted with label noise. Robustness to label noise implies a lower generalization gap; thus, our approach also leads to improved generalizability. We find evidence that the Rademacher complexity of these algorithms is low, suggesting a new path towards provable generalization for this type of deep learning model. Last, we highlight a novel connection between difficulties accounting for rare classes and robustness under noise, as rare classes are in a sense indistinguishable from label noise. Our code can be found at https://github.com/Tuxianeer/generalizationconfusion.
Jun-13-2020
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- United States
- Utah > Salt Lake County
- Salt Lake City (0.04)
- Massachusetts
- Middlesex County > Cambridge (0.04)
- Suffolk County > Boston (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- California > Los Angeles County
- Long Beach (0.14)
- Utah > Salt Lake County
- Canada > British Columbia
- United States
- Europe
- France (0.04)
- United Kingdom > England
- North Yorkshire > York (0.04)
- Cambridgeshire > Cambridge (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Asia > South Korea
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Oceania > Australia
- Genre:
- Research Report (0.50)
- Technology: