Long Context Modeling with Ranked Memory-Augmented Retrieval

Alselwi, Ghadir, Xue, Hao, Jameel, Shoaib, Suleiman, Basem, Salim, Flora D., Razzak, Imran

arXiv.org Artificial Intelligence 

Effective long-term memory management is crucial for language models handling extended contexts. We introduce a novel framework that dynamically ranks memory entries based on relevance. Unlike previous works, our model introduces a novel relevance scoring and a pointwise re-ranking model for key-value embeddings, inspired by learning-to-rank techniques in information retrieval. Enhanced Ranked Memory Augmented Retrieval ERMAR achieves state-of-the-art results on standard benchmarks.