Do ImageNet Classifiers Generalize to ImageNet?

Recht, Benjamin, Roelofs, Rebecca, Schmidt, Ludwig, Shankar, Vaishaal

Feb-13-2019–arXiv.org Machine Learning

We build new test sets for the CIFAR-10 and ImageNet datasets. Both benchmarks have been the focus of intense research for almost a decade, raising the danger of overfitting to excessively re-used test sets. By closely following the original dataset creation processes, we test to what extent current classification models generalize to new data. We evaluate a broad range of models and find accuracy drops of 3% - 15% on CIFAR-10 and 11% - 14% on ImageNet. However, accuracy gains on the original test sets translate to larger gains on the new test sets. Our results suggest that the accuracy drops are not caused by adaptivity, but by the models' inability to generalize to slightly "harder" images than those found in the original test sets.

accuracy, machine learning, natural language, (20 more...)

arXiv.org Machine Learning

Feb-13-2019

arXiv.org PDF

Add feedback

Country:
- Oceania > New Zealand
  - South Island > Marlborough District > Blenheim (0.04)
- North America > United States
  - Tennessee (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Leisure & Entertainment (0.67)
- Government > Military (0.67)
- Aerospace & Defense (0.67)
- Transportation
  - Passenger (1.00)
  - Marine (1.00)
  - Ground > Road (1.00)
  - Freight & Logistics Services > Shipping (0.92)
  - Air (0.67)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (1.00)
  - Information Management (1.00)
  - Artificial Intelligence
    - Vision (1.00)
    - Representation & Reasoning (1.00)
    - Natural Language (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found