xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token

Neural Information Processing Systems 

This paper introduces xRAG, an innovative context compression method tailored for retrieval-augmented generation.