MeMo: Towards Language Models with Associative Memory Mechanisms

Zanzotto, Fabio Massimo, Ruzzetti, Elena Sofia, Xompero, Giancarlo A., Ranaldi, Leonardo, Venditti, Davide, Ranaldi, Federico, Giannone, Cristina, Favalli, Andrea, Romagnoli, Raniero

Feb-18-2025–arXiv.org Artificial Intelligence

Memorization is a fundamental ability of Transformer-based Large Language Models, achieved through learning. In this paper, we propose a paradigm shift by designing an architecture to memorize text directly, bearing in mind the principle that memorization precedes learning. We introduce MeMo, a novel architecture for language modeling that explicitly memorizes sequences of tokens in layered associative memories. By design, MeMo offers transparency and the possibility of model editing, including forgetting texts. We experimented with the MeMo architecture, showing the memorization power of the one-layer and the multi-layer configurations.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Feb-18-2025

arXiv.org PDF

Add feedback

Country:
- Europe (1.00)
- North America
  - Mexico (0.28)
  - United States (0.46)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Cognitive Science > Problem Solving (0.61)
  - Machine Learning > Neural Networks
    - Deep Learning (0.34)
  - Natural Language > Large Language Model (0.87)
  - Systems & Languages > Programming Languages (0.61)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found