A Psycholinguistic Evaluation of Language Models' Sensitivity to Argument Roles
Lee, Eun-Kyoung Rosa, Nair, Sathvik, Feldman, Naomi
–arXiv.org Artificial Intelligence
We present a systematic evaluation of large language models' sensitivity to argument roles, i.e., who did what to whom, by replicating psycholinguistic studies on human argument role processing. In three experiments, we find that language models are able to distinguish verbs that appear in plausible and implausible contexts, where plausibility is determined through the relation between the verb and its preceding arguments. However, none of the models capture the same selective patterns that human comprehenders exhibit during real-time verb prediction. This indicates that language models' capacity to detect verb plausibility does not arise from the same mechanism that underlies human real-time sentence processing.
arXiv.org Artificial Intelligence
Oct-21-2024
- Country:
- Asia
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Singapore (0.05)
- Middle East > UAE
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy > Tuscany
- Florence (0.05)
- Belgium > Brussels-Capital Region
- North America > United States
- Maryland > Prince George's County
- College Park (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Maryland > Prince George's County
- Asia
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Consumer Products & Services > Restaurants (0.47)
- Technology: