Document-Level Text Generation with Minimum Bayes Risk Decoding using Optimal Transport
–arXiv.org Artificial Intelligence
Document-level text generation tasks are known to be more difficult than sentence-level text generation tasks as they require the understanding of longer context to generate high-quality texts. In this paper, we investigate the adaption of Minimum Bayes Risk (MBR) decoding for document-level text generation tasks. MBR decoding makes use of a utility function to estimate the output with the highest expected utility from a set of candidate outputs. Although MBR decoding is shown to be effective in a wide range of sentence-level text generation tasks, its performance on document-level text generation tasks is limited as many of the utility functions are designed for evaluating the utility of sentences. To this end, we propose MBR-OT, a variant of MBR decoding using Wasserstein distance to compute the utility of a document using a sentence-level utility function. The experimental result shows that the performance of MBR-OT outperforms that of the standard MBR in document-level machine translation, text simplification, and dense image captioning tasks. Our code is available at https://github.com/jinnaiyuu/mbr-optimal-transport
arXiv.org Artificial Intelligence
May-30-2025
- Country:
- Africa > Mali (0.04)
- Asia
- China > Hong Kong (0.04)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Singapore > Central Region
- Singapore (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- France > Hauts-de-France
- Monaco (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Switzerland (0.04)
- United Kingdom > England
- South Yorkshire > Sheffield (0.04)
- Belgium > Brussels-Capital Region
- North America
- Canada > Ontario
- Toronto (0.04)
- Dominican Republic (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- California > Los Angeles County
- Los Angeles > Hollywood
- West Hollywood (0.04)
- Santa Monica (0.04)
- Los Angeles > Hollywood
- Florida > Miami-Dade County
- Miami (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Washington > King County
- Seattle (0.04)
- California > Los Angeles County
- Canada > Ontario
- Genre:
- Research Report > New Finding (0.48)
- Technology: