Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms
Trabelsi, Firas, Vilar, David, Finkelstein, Mara, Freitag, Markus
–arXiv.org Artificial Intelligence
Minimum Bayes Risk (MBR) decoding is a powerful decoding strategy widely used for text generation tasks, but its quadratic computational complexity limits its practical application. This paper presents a novel approach for approximating MBR decoding using matrix completion techniques, focusing on the task of machine translation. We formulate MBR decoding as a matrix completion problem, where the utility metric scores between candidate hypotheses and pseudo-reference translations form a low-rank matrix. First, we empirically show that the scores matrices indeed have a low-rank structure. Then, we exploit this by only computing a random subset of the scores and efficiently recover the missing entries in the matrix by applying the Alternating Least Squares (ALS) algorithm, thereby enabling a fast approximation of the MBR decoding process. Our experimental results on machine translation tasks demonstrate that the proposed method requires 1/16 utility metric computations compared to vanilla MBR decoding while achieving equal translation quality measured by COMET22 on the WMT22 dataset (en<>de and en<>ru). We also benchmark our method against other approximation methods and we show gains in quality when comparing to them.
arXiv.org Artificial Intelligence
Jun-4-2024
- Country:
- North America > United States
- Washington > King County
- Seattle (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- New York > New York County
- New York City (0.04)
- California > San Francisco County
- San Francisco (0.04)
- Washington > King County
- Europe
- Asia
- Singapore (0.05)
- China > Hong Kong (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.05)
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (0.68)
- Technology: