Ambiguous Images With Human Judgments for Robust Visual Event Classification

Oct-9-2024, 18:58:46 GMT–Neural Information Processing Systems

Contemporary vision benchmarks predominantly consider tasks on which humans can achieve near-perfect performance. However, humans are frequently presented with visual data that they cannot classify with 100% certainty, and models trained on standard vision benchmarks achieve low performance when evaluated on this data. To address this issue, we introduce a procedure for creating datasets of ambiguous images and use it to produce SQUID-E ("Squidy"), a collection of noisy images extracted from videos. All images are annotated with ground truth values and a test set is annotated with human uncertainty judgments. We use this dataset to characterize human uncertainty in vision tasks and evaluate existing visual event classification models.

ambiguous image, human judgment, robust visual event classification, (2 more...)

Neural Information Processing Systems

Oct-9-2024, 18:58:46 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Vision (0.44)
  - Machine Learning (0.44)