Revenge of the Fallen? Recurrent Models Match Transformers at Predicting Human Language Comprehension Metrics

Michaelov, James A., Arnett, Catherine, Bergen, Benjamin K.

Apr-29-2024–arXiv.org Artificial Intelligence

Transformers have supplanted Recurrent Neural Networks as the dominant architecture for both natural language processing tasks and, despite criticisms of cognitive implausibility, for modelling the effect of predictability on online human language comprehension. However, two recently developed recurrent neural network architectures, RWKV and Mamba, appear to perform natural language tasks comparably to or better than transformers of equivalent scale. In this paper, we show that contemporary recurrent models are now also able to match - and in some cases, exceed - performance of comparably sized transformers at modeling online human language comprehension. This suggests that transformer language models are not uniquely suited to this task, and opens up new directions for debates about the extent to which architectural features of language models make them better or worse models of human language comprehension.

dataset, federmeier, language comprehension, (14 more...)

arXiv.org Artificial Intelligence

Apr-29-2024

arXiv.org PDF

Add feedback

Country:
- South America > Paraguay
  - Asunción > Asunción (0.04)
- North America > United States
  - Utah > Salt Lake County
    - Salt Lake City (0.04)
  - Massachusetts > Suffolk County
    - Boston (0.04)
  - California
    - San Francisco County > San Francisco (0.04)
    - San Diego County
      - San Diego (0.04)
      - La Jolla (0.04)
- Europe
  - Austria > Vienna (0.14)
  - Italy > Tuscany
    - Florence (0.04)
  - Hungary > Budapest
    - Budapest (0.04)
- Asia
  - Singapore (0.04)
  - Indonesia > Bali (0.04)
  - Middle East
    - Jordan (0.04)
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.04)
  - Japan
    - Kyūshū & Okinawa > Kyūshū
      - Miyazaki Prefecture > Miyazaki (0.04)
    - Honshū > Chūbu
      - Toyama Prefecture > Toyama (0.04)

Genre:
- Research Report
  - New Finding (0.93)
  - Experimental Study (0.67)

Industry:
- Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found