Towards Lossless Token Pruning in Late-Interaction Retrieval Models
Zong, Yuxuan, Piwowarski, Benjamin
–arXiv.org Artificial Intelligence
Late interaction neural IR models like ColBERT offer a competitive effectiveness-efficiency trade-off across many benchmarks. However, they require a huge memory space to store the contextual representation for all the document tokens. Some works have proposed using either heuristics or statistical-based techniques to prune tokens from each document. This however doesn't guarantee that the removed tokens have no impact on the retrieval score. Our work uses a principled approach to define how to prune tokens without impacting the score between a document and a query. We introduce three regularization losses, that induce a solution with high pruning ratios, as well as two pruning strategies. We study them experimentally (in and out-domain), showing that we can preserve ColBERT's performance while using only 30\% of the tokens.
arXiv.org Artificial Intelligence
Apr-18-2025
- Country:
- Africa > Cameroon
- Far North Region > Maroua (0.04)
- Asia
- Europe
- France > Île-de-France
- Ireland
- Leinster > County Dublin
- Dublin (0.04)
- Munster > County Limerick
- Limerick (0.04)
- Leinster > County Dublin
- Italy (0.05)
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Spain > Galicia
- Madrid (0.04)
- Switzerland (0.04)
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- Dominican Republic (0.04)
- United States
- Arizona > Maricopa County
- Phoenix (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Indiana > Marion County
- Indianapolis (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New York > New York County
- New York City (0.05)
- Oregon > Multnomah County
- Portland (0.04)
- Washington > King County
- Seattle (0.04)
- Arizona > Maricopa County
- Canada
- South America > Colombia
- Meta Department > Villavicencio (0.04)
- Africa > Cameroon
- Genre:
- Research Report (1.00)
- Technology: