mFACE: Multilingual Summarization with Factual Consistency Evaluation
Aharoni, Roee, Narayan, Shashi, Maynez, Joshua, Herzig, Jonathan, Clark, Elizabeth, Lapata, Mirella
–arXiv.org Artificial Intelligence
Abstractive summarization has enjoyed renewed interest in recent years, thanks to pre-trained language models and the availability of large-scale datasets. Despite promising results, current models still suffer from generating factually inconsistent summaries, reducing their utility for real-world application. Several recent efforts attempt to address this by devising models that automatically detect factual inconsistencies in machine generated summaries. However, they focus exclusively on English, a language with abundant resources. In this work, we leverage factual consistency evaluation models to improve multilingual summarization. We explore two intuitive approaches to mitigate hallucinations based on the signal provided by a multilingual NLI model, namely data filtering and controlled generation. Experimental results in the 45 languages from the XLSum dataset show gains over strong baselines in both automatic and human evaluation.
arXiv.org Artificial Intelligence
Jan-5-2024
- Country:
- Atlantic Ocean > Black Sea (0.04)
- South America > Argentina (0.04)
- North America
- Dominican Republic (0.04)
- Canada (0.04)
- United States
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New Mexico > Santa Fe County
- Europe
- Serbia (0.04)
- France (0.04)
- Ukraine (0.04)
- Czechia (0.04)
- Portugal (0.04)
- Hungary (0.04)
- Italy > Tuscany
- Florence (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Russia > Northwestern Federal District
- Murmansk Oblast > Murmansk (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Russia (0.14)
- China > Hong Kong (0.04)
- India (0.04)
- Thailand (0.04)
- Sri Lanka (0.04)
- Kyrgyzstan (0.04)
- Myanmar (0.04)
- Taiwan (0.04)
- South Korea (0.04)
- Indonesia (0.04)
- Japan (0.04)
- Vietnam (0.04)
- Azerbaijan (0.04)
- Uzbekistan (0.04)
- Nepal (0.04)
- Pakistan (0.04)
- Middle East
- Saudi Arabia (0.04)
- Republic of Türkiye (0.04)
- Africa
- Niger (0.04)
- Rwanda (0.04)
- Nigeria (0.04)
- Kenya (0.04)
- Ethiopia (0.04)
- Burundi (0.04)
- Middle East
- Somalia (0.04)
- Algeria
- Blida Province > Blida (0.04)
- Tipaza Province > Tipaza (0.04)
- Ain Defla Province > Ain Defla (0.04)
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Technology: