While the ImageNet dataset has been driving computer vision research over the past decade, significant label noise and ambiguity have made top-1 accuracy an insufficient measure of further progress.
Toeasethelearning, wedemonstrate thatitisbeneficial toadoptacurriculum learning strategy [23], where harder negatives are introduced after an initial stage of learning on easiernegatives.
Our theoretical analysis reveals that in this setting, in-context learning is more about identifying the task than about learning it, a result which is in line with a series of recent empirical findings.