Aspects of human memory and Large Language Models
–arXiv.org Artificial Intelligence
Large Language Models (LLMs) are huge artificial neural networks which primarily serve to generate text, but also provide a very sophisticated probabilistic model of language use. Since generating a semantically consistent text requires a form of effective memory, we investigate the memory properties of LLMs and find surprising similarities with key characteristics of human memory. We argue that the human-like memory properties of the Large Language Model do not follow automatically from the LLM architecture but Figure 1: Recall accuracy for a serial memory are rather learned from the statistics of the experiment with human subjects (sample training textual data. These results strongly data from [4]) and for a memorization experiment suggest that the biological features of human of a list of 20 facts of the has-a type for memory leave an imprint on the way that we the Large Language Model GPT-J [5] studied structure our textual narratives.
arXiv.org Artificial Intelligence
Nov-9-2023
- Country:
- Africa > Middle East
- Egypt > Cairo Governorate > Cairo (0.04)
- Asia
- India (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- Philippines > Luzon
- National Capital Region > City of Manila (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Europe
- North America
- Cuba > La Habana Province
- Havana (0.04)
- Mexico (0.04)
- United States
- California > San Diego County
- San Diego (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Illinois > Cook County
- Chicago (0.04)
- New York (0.04)
- California > San Diego County
- Cuba > La Habana Province
- South America > Brazil (0.04)
- Africa > Middle East
- Genre:
- Research Report (0.70)
- Industry:
- Automobiles & Trucks > Manufacturer (0.46)
- Technology: