Generalizations across filler-gap dependencies in neural language models
Howitt, Katherine, Nair, Sathvik, Dods, Allison, Hopkins, Robert Melvin
–arXiv.org Artificial Intelligence
Humans develop their grammars by making structural generalizations from finite input. We ask how filler-gap dependencies, which share a structural generalization despite diverse surface forms, might arise from the input. We explicitly control the input to a neural language model (NLM) to uncover whether the model posits a shared representation for filler-gap dependencies. We show that while NLMs do have success differentiating grammatical from ungrammatical filler-gap dependencies, they rely on superficial properties of the input, rather than on a shared generalization. Our work highlights the need for specific linguistic inductive biases to model language acquisition.
arXiv.org Artificial Intelligence
Oct-23-2024
- Country:
- North America
- United States
- Maryland (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts
- Middlesex County > Cambridge (0.04)
- Hampshire County > Amherst (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California
- Santa Clara County > Stanford (0.04)
- Orange County > Irvine (0.04)
- Canada > Ontario
- Toronto (0.04)
- United States
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Netherlands > South Holland
- Dordrecht (0.04)
- United Kingdom > England
- North America
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (0.69)
- Research Report
- Technology: