Saving Dense Retriever from Shortcut Dependency in Conversational Search
–arXiv.org Artificial Intelligence
Conversational search (CS) needs a holistic understanding of conversational inputs to retrieve relevant passages. In this paper, we demonstrate the existence of a retrieval shortcut in CS, which causes models to retrieve passages solely relying on partial history while disregarding the latest question. With in-depth analysis, we first show that naively trained dense retrievers heavily exploit the shortcut and hence perform poorly when asked to answer history-independent questions. To build more robust models against shortcut dependency, we explore various hard negative mining strategies. Experimental results show that training with the model-based hard negatives (Xiong et al., 2020) effectively mitigates the dependency on the shortcut, significantly improving dense retrievers on recent CS benchmarks. In particular, our retriever outperforms the previous state-of-the-art model by Figure 1: An example of a retrieval shortcut in conversational 11.0 in Recall@10 on QReCC (Anantha et al., search. While we expect the retriever to predict 2021).
arXiv.org Artificial Intelligence
Oct-19-2022
- Country:
- Europe
- Switzerland (0.04)
- Russia (0.04)
- Monaco (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Greece > Attica
- Athens (0.04)
- Asia
- Russia (0.04)
- Middle East > Jordan (0.04)
- Europe
- Genre:
- Research Report > New Finding (0.66)
- Industry:
- Media > Film (0.68)
- Leisure & Entertainment (0.68)
- Technology: