Generalization by Recognizing Confusion

Chiu, Daniel, Wang, Franklyn, Kominers, Scott Duke

Jun-13-2020–arXiv.org Machine Learning

A recently-proposed technique called self-adaptive training augments modern neural networks by allowing them to adjust training labels on the fly, to avoid overfitting to samples that may be mislabeled or otherwise non-representative. By combining the self-adaptive objective with mixup, we further improve the accuracy of self-adaptive models for image recognition; the resulting classifier obtains state-of-the-art accuracies on datasets corrupted with label noise. Robustness to label noise implies a lower generalization gap; thus, our approach also leads to improved generalizability. We find evidence that the Rademacher complexity of these algorithms is low, suggesting a new path towards provable generalization for this type of deep learning model. Last, we highlight a novel connection between difficulties accounting for rare classes and robustness under noise, as rare classes are in a sense indistinguishable from label noise. Our code can be found at https://github.com/Tuxianeer/generalizationconfusion.

artificial intelligence, machine learning, self-adaptive training, (19 more...)

arXiv.org Machine Learning

Jun-13-2020

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - New South Wales > Sydney (0.04)
- North America
  - United States
    - Utah > Salt Lake County
      - Salt Lake City (0.04)
    - Massachusetts
      - Middlesex County > Cambridge (0.04)
      - Suffolk County > Boston (0.04)
    - Hawaii > Honolulu County
      - Honolulu (0.04)
    - California > Los Angeles County
      - Long Beach (0.14)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.05)
- Europe
  - France (0.04)
  - United Kingdom > England
    - North Yorkshire > York (0.04)
    - Cambridgeshire > Cambridge (0.04)
  - Germany > Bavaria
    - Upper Bavaria > Munich (0.04)
- Asia > South Korea
  - Seoul > Seoul (0.04)
- Africa > Ethiopia
  - Addis Ababa > Addis Ababa (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found