It Takes Two: A Dual Stage Approach for Terminology-Aware Translation
–arXiv.org Artificial Intelligence
This paper introduces DuTerm, a novel two-stage architecture for terminology-constrained machine translation. Our system combines a terminology-aware NMT model, adapted via fine-tuning on large-scale synthetic data, with a prompt-based LLM for post-editing. The LLM stage refines NMT output and enforces terminology adherence. We evaluate DuTerm on English-to German, English-to-Spanish, and English-to-Russian with the WMT 2025 Terminology Shared Task corpus. We demonstrate that flexible, context-driven terminology handling by the LLM consistently yields higher quality translations than strict constraint enforcement. Our results highlight a critical trade-off, revealing that an LLM's work best for high-quality translation as context-driven mutators rather than generators.
arXiv.org Artificial Intelligence
Nov-12-2025
- Country:
- Asia
- China (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Singapore (0.04)
- Europe
- North America
- Canada > Ontario
- Toronto (0.04)
- United States > Pennsylvania
- Philadelphia County > Philadelphia (0.04)
- Canada > Ontario
- Asia
- Genre:
- Research Report > New Finding (0.49)
- Technology: