MIT researchers find 'systematic' shortcomings in ImageNet data set

Jul-15-2020, 16:45:31 GMT–#artificialintelligence

MIT researchers have concluded that the well-known ImageNet data set has "systematic annotation issues" and is misaligned with ground truth or direct observation when used as a benchmark data set. "Our analysis pinpoints how a noisy data collection pipeline can lead to a systematic misalignment between the resulting benchmark and the real-world task it serves as a proxy for," the researchers write in a paper titled "From ImageNet to Image Classification: Contextualizing Progress on Benchmarks." "We believe that developing annotation pipelines that better capture the ground truth while remaining scalable is an important avenue for future research." When the Stanford University Vision Lab introduced ImageNet at the Conference on Computer Vision and Pattern Recognition (CVPR) in 2009, it was much larger than many previously existing image data sets. The ImageNet data set contains millions of photos and was assembled over the span of more than two years. ImageNet uses the WordNet hierarchy for data labels and is widely used as a benchmark for object recognition models.

artificial intelligence, imagenet data, natural language, (15 more...)

#artificialintelligence

Jul-15-2020, 16:45:31 GMT

News Web Page

Add feedback

Genre:
- Research Report > New Finding (0.36)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (0.62)
  - Artificial Intelligence
    - Natural Language (0.95)
    - Vision (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found