Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous Decoding
Zeng, Hansi, Luo, Chen, Zamani, Hamed
–arXiv.org Artificial Intelligence
This paper introduces PAG-a novel optimization and decoding approach that guides autoregressive generation of document identifiers in generative retrieval models through simultaneous decoding. To this aim, PAG constructs a set-based and sequential identifier for each document. Motivated by the bag-of-words assumption in information retrieval, the set-based identifier is built on lexical tokens. The sequential identifier, on the other hand, is obtained via quantizing relevance-based representations of documents. Extensive experiments on MSMARCO and TREC Deep Learning Track data reveal that PAG outperforms the state-of-the-art generative retrieval model by a large margin (e.g., 15.6% MRR improvements on MS MARCO), while achieving 22x speed up in terms of query latency.
arXiv.org Artificial Intelligence
Apr-22-2024
- Country:
- North America > United States
- District of Columbia > Washington (0.05)
- New York > New York County
- New York City (0.04)
- Massachusetts > Hampshire County
- Amherst (0.04)
- Europe
- Switzerland > Basel-City
- Basel (0.04)
- Spain > Galicia
- Madrid (0.04)
- Switzerland > Basel-City
- Asia
- Taiwan > Taiwan Province
- Taipei (0.04)
- Singapore > Central Region
- Singapore (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Taiwan > Taiwan Province
- North America > United States
- Genre:
- Research Report > New Finding (0.46)
- Technology: