Evaluating the Efficacy of Length-Controllable Machine Translation
Cheng, Hao, Zhang, Meng, Wang, Weixuan, Li, Liangyou, Liu, Qun, Zhang, Zhihua
–arXiv.org Artificial Intelligence
Length-controllable machine translation is a type of constrained translation. It aims to contain the original meaning as much as possible while controlling the length of the translation. We can use automatic summarization or machine translation evaluation metrics for length-controllable machine translation, but this is not necessarily suitable and accurate. This work is the first attempt to evaluate the automatic metrics for length-controllable machine translation tasks systematically. We conduct a rigorous human evaluation on two translation directions and evaluate 18 summarization or translation evaluation metrics. We find that BLEURT and COMET have the highest correlation with human evaluation and are most suitable as evaluation metrics for length-controllable machine translation.
arXiv.org Artificial Intelligence
May-3-2023
- Country:
- Oceania > Australia (0.04)
- North America
- United States
- Pennsylvania (0.04)
- Texas > Travis County
- Austin (0.04)
- New York > New York County
- New York City (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California
- San Diego County > San Diego (0.04)
- Los Angeles County > Long Beach (0.04)
- Canada > Quebec
- Montreal (0.04)
- United States
- Europe
- Germany > Berlin (0.04)
- Austria (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Genre:
- Research Report > Experimental Study (0.47)
- Technology: