Datasets for Studying Generalization from Easy to Hard Examples

Schwarzschild, Avi, Borgnia, Eitan, Gupta, Arjun, Bansal, Arpit, Emam, Zeyad, Huang, Furong, Goldblum, Micah, Goldstein, Tom

arXiv.org Artificial Intelligence 

In domains like computer vision, single and multi-agent games, and mathematical reasoning, classically trained models perform well on inputs from the same distribution used for training, but often fail to extrapolate their knowledge to more difficult tasks sampled from a different (but related) distribution. The goal of approaches like deep thinking and algorithm learning is to construct systems that achieve this extrapolation. With this in mind, we detail several datasets intended to motivate and facilitate novel research into systems that generalize from easy training data to harder test examples.