Rational Retrieval Acts: Leveraging Pragmatic Reasoning to Improve Sparse Retrieval
Satouf, Arthur, Zenou, Gabriel Ben, Piwowarski, Benjamin, Boubacar, Habiboulaye Amadou, Piantanida, Pablo
–arXiv.org Artificial Intelligence
Current sparse neural information retrieval (IR) methods, and to a lesser extent more traditional models such as BM25, do not take into account the document collection and the complex interplay between different term weights when representing a single document. In this paper, we show how the Rational Speech Acts (RSA), a linguistics framework used to minimize the number of features to be communicated when identifying an object in a set, can be adapted to the IR case -- and in particular to the high number of potential features (here, tokens). RSA dynamically modulates token-document interactions by considering the influence of other documents in the dataset, better contrasting document representations. Experiments show that incorporating RSA consistently improves multiple sparse retrieval models and achieves state-of-the-art performance on out-of-domain datasets from the BEIR benchmark. https://github.com/arthur-75/Rational-Retrieval-Acts
arXiv.org Artificial Intelligence
May-8-2025
- Country:
- Asia > Middle East
- Saudi Arabia > Asir Province > Abha (0.04)
- Europe
- France (0.05)
- Italy (0.05)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- North America
- Canada > Quebec (0.04)
- Dominican Republic (0.04)
- United States
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York > New York County
- New York City (0.04)
- Massachusetts > Middlesex County
- Asia > Middle East
- Genre:
- Research Report (1.00)
- Technology: