Augmenting Language Models with Long-Term Memory

Apr-30-2026, 04:54:44 GMT–Neural Information Processing Systems

Existing large language models (LLMs) can only afford fix-sized inputs due to the input length limit, preventing them from utilizing rich long-context information from past inputs. To address this, we propose a framework, Language Models Augmented with Long-Term Memory (LONGMEM), which enables LLMs to memorize long history. We design a novel decoupled network architecture with the original backbone LLM frozen as a memory encoder and an adaptive residual side-network as a memory retriever and reader. Such a decoupled memory design can easily cache and update long-term past contexts for memory retrieval without suffering from memory staleness. Enhanced with memory-augmented adaptation training, LONGMEM can thus memorize long past context and use long-term memory for language modeling.

large language model, machine learning, natural language, (15 more...)

Neural Information Processing Systems

Apr-30-2026, 04:54:44 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.47)

Duplicate Docs Excel Report

Title
Augmenting Language Models with Long-Term Memory Weizhi Wang, Li Dong

Similar Docs Excel Report more

Title	Similarity	Source
None found