xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token Xun Wang

Open in new window