Partial success in closing the gap between human and machine vision
–Neural Information Processing Systems
A few years ago, the first CNN surpassed human performance on ImageNet. However, it soon became clear that machines lack robustness on more challenging test cases, a major obstacle towards deploying machines in the wild and towards obtaining better computational models of human visual perception. Here we ask: Are we making progress in closing the gap between human and machine vision? To answer this question, we tested human observers on a broad range of out-of-distribution (OOD) datasets, recording 85,120 psychophysical trials across 90 participants. We then investigated a range of promising machine learning developments that crucially deviate from standard supervised CNNs along three axes: objective function (self-supervised, adversarially trained, CLIP language-image training), architecture (e.g.
Neural Information Processing Systems
Dec-24-2025, 21:48:40 GMT
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (0.55)
- Vision (0.43)
- Information Technology > Artificial Intelligence