Evaluation of NMT-Assisted Grammar Transfer for a Multi-Language Configurable Data-to-Text System
Madsack, Andreas, Heininger, Johanna, Schneider, Adela, Chen, Ching-Yi, Eckard, Christian, Weißgraeber, Robert
–arXiv.org Artificial Intelligence
One approach for multilingual data-to-text generation is to translate grammatical configurations upfront from the source language into each target language. These configurations are then used by a surface realizer and in document planning stages to generate output. In this paper, we describe a rule-based NLG implementation of this approach where the configuration is translated by Neural Machine Translation (NMT) combined with a one-time human review, and introduce a cross-language grammar dependency model to create a multilingual NLG system that generates text from the source data, scaling the generation phase without a human in the loop. Additionally, we introduce a method for human post-editing evaluation on the automatically translated text. Our evaluation on the SportSett:Basketball dataset shows that our NLG system performs well, underlining its grammatical correctness in translation tasks.
arXiv.org Artificial Intelligence
Jan-27-2025
- Country:
- South America > Chile
- North America > United States
- Illinois > Cook County > Chicago (0.07)
- Europe
- United Kingdom > Scotland
- City of Aberdeen > Aberdeen (0.04)
- Spain > Galicia
- A Coruña Province > Santiago de Compostela (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Greece > Attica
- Athens (0.04)
- Germany > Baden-Württemberg
- Stuttgart Region > Stuttgart (0.04)
- United Kingdom > Scotland
- Asia
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- Middle East > Republic of Türkiye
- Genre:
- Research Report (0.50)
- Industry:
- Leisure & Entertainment > Sports > Basketball (0.31)
- Technology: