Generalization without systematicity: On the compositional skills of sequence-to-sequence recurrent networks
Lake, Brenden M., Baroni, Marco
–arXiv.org Artificial Intelligence
Humans can understand and produce new utterances effortlessly, thanks to their compositional skills. Once a person learns the meaning of a new verb "dax," he or she can immediately understand the meaning of "dax twice" or "sing and dax." In this paper, we introduce the SCAN domain, consisting of a set of simple compositional navigation commands paired with the corresponding action sequences. We then test the zero-shot generalization capabilities of a variety of recurrent neural networks (RNNs) trained on SCAN with sequence-to-sequence methods. We find that RNNs can make successful zero-shot generalizations when the differences between training and test commands are small, so that they can apply "mix-and-match" strategies to solve the task. However, when generalization requires systematic compositional skills (as in the "dax" example above), RNNs fail spectacularly. We conclude with a proof-of-concept experiment in neural machine translation, suggesting that lack of systematicity might be partially responsible for neural networks' notorious training data thirst.
arXiv.org Artificial Intelligence
Jun-6-2018
- Country:
- Europe
- Germany > Berlin (0.05)
- Italy > Veneto
- Venice (0.04)
- Netherlands
- North Holland > Amsterdam (0.04)
- South Holland > Dordrecht (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.14)
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Puerto Rico > San Juan
- San Juan (0.04)
- United States
- California > San Diego County
- San Diego (0.04)
- Massachusetts > Middlesex County
- New York (0.04)
- California > San Diego County
- Canada
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Europe
- Genre:
- Research Report > New Finding (0.93)
- Technology: