Improving Out-of-Distribution Generalization of Neural Rerankers with Contextualized Late Interaction

Feb-13-2023–arXiv.org Artificial Intelligence

Recent progress in information retrieval finds that embedding query and document representation into multi-vector yields a robust bi-encoder retriever on out-of-distribution datasets. In this paper, we explore whether late interaction, the simplest form of multi-vector, is also helpful to neural rerankers that only use the [CLS] vector to compute the similarity score. Although intuitively, the attention mechanism of rerankers at the previous layers already gathers the token-level information, we find adding late interaction still brings an extra 5% improvement in average on out-of-distribution datasets, with little increase in latency and no degradation in in-domain effectiveness. Through extensive experiments and analysis, we show that the finding is consistent across different model sizes and first-stage retrievers of diverse natures and that the improvement is more prominent on longer queries.

information retrieval, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

Feb-13-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - District of Columbia > Washington (0.05)
    - Texas > Harris County
      - Houston (0.04)
    - New York > New York County
      - New York City (0.05)
    - Indiana > Marion County
      - Indianapolis (0.04)
  - Canada
    - Ontario > Waterloo Region
      - Waterloo (0.04)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.04)
- Europe
  - Ireland (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
- Asia > Japan
  - Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.28)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Information Retrieval (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found