Exploring the Effectiveness of Multi-stage Fine-tuning for Cross-encoder Re-rankers

Pezzuti, Francesca, MacAvaney, Sean, Tonellotto, Nicola

Mar-28-2025–arXiv.org Artificial Intelligence

State-of-the-art cross-encoders can be fine-tuned to be highly effective in passage re-ranking. The typical fine-tuning process of cross-encoders as re-rankers requires large amounts of manually labelled data, a contrastive learning objective, and a set of heuristically sampled negatives. An alternative recent approach for fine-tuning instead involves teaching the model to mimic the rankings of a highly effective large language model using a distillation objective. These fine-tuning strategies can be applied either individually, or in sequence. In this work, we systematically investigate the effectiveness of point-wise cross-encoders when fine-tuned independently in a single stage, or sequentially in two stages. Our experiments show that the effectiveness of point-wise cross-encoders fine-tuned using contrastive learning is indeed on par with that of models fine-tuned with multi-stage approaches. Code is available for reproduction at https://github.com/fpezzuti/multistage-finetuning.

effectiveness, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Mar-28-2025

arXiv.org PDF

Add feedback

Country:
- Europe > Italy > Tuscany > Pisa Province > Pisa (0.40)

Genre:
- Research Report
  - Experimental Study > Negative Result (0.46)
  - New Finding (0.94)

Industry:
- Government > Regional Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Natural Language > Large Language Model (0.49)