Multilingual Dialogue Generation and Localization with Dialogue Act Scripting
Vasselli, Justin, Kardinata, Eunike Andriani, Sakai, Yusuke, Watanabe, Taro
–arXiv.org Artificial Intelligence
Non-English dialogue datasets are scarce, and models are often trained or evaluated on translations of English-language dialogues, an approach which can introduce artifacts that reduce their naturalness and cultural appropriateness. This work proposes Dialogue Act Script (DAS), a structured framework for encoding, localizing, and generating multilingual dialogues from abstract intent representations. Rather than translating dialogue utterances directly, DAS enables the generation of new dialogues in the target language that are culturally and contextually appropriate. By using structured dialogue act representations, DAS supports flexible localization across languages, mitigating translationese and enabling more fluent, naturalistic conversations. Human evaluations across Italian, German, and Chinese show that DAS-generated dialogues consistently outperform those produced by both machine and human translators on measures of cultural relevance, coherence, and situational appropriateness.
arXiv.org Artificial Intelligence
Sep-29-2025
- Country:
- Asia
- Europe
- Belgium > Flanders
- East Flanders > Ghent (0.04)
- Czechia > Prague (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Belgium > Flanders
- North America
- Canada > Ontario
- Toronto (0.04)
- United States
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New York (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Washington > King County
- Seattle (0.04)
- Louisiana > Orleans Parish
- Canada > Ontario
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Education > Educational Setting (0.46)
- Technology: