DiscoTK: Using Discourse Structure for Machine Translation Evaluation

Joty, Shafiq, Guzman, Francisco, Marquez, Lluis, Nakov, Preslav

Nov-28-2019–arXiv.org Artificial Intelligence

We present novel automatic metrics for machine translation evaluation that use discourse structure and convolution kernels to compare the discourse tree of an automatic translation with that of the human reference. We experiment with five transformations and augmentations of a base discourse tree representation based on the rhetorical structure theory, and we combine the kernel scores for each of them into a single score. Finally, we add other metrics from the ASIYA MT evaluation toolkit, and we tune the weights of the combination on actual human judgments. Experiments on the WMT12 and WMT13 metrics shared task datasets show correlation with human judgments that outperforms what the best systems that participated in these years achieved, both at the segment and at the system level.

metric, representation, translation, (16 more...)

arXiv.org Artificial Intelligence

Nov-28-2019

arXiv.org PDF

Add feedback

Country:
- North America
  - Canada (0.04)
  - United States
    - Maryland > Baltimore (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Hawaii > Honolulu County
      - Honolulu (0.04)
    - California
      - San Francisco County > San Francisco (0.14)
      - San Diego County > San Diego (0.04)
- Europe
  - Czechia > Prague (0.05)
  - United Kingdom > Scotland
    - City of Edinburgh > Edinburgh (0.04)
  - Italy > Trentino-Alto Adige/Südtirol
    - Trentino Province > Trento (0.04)
- Asia
  - Middle East > Qatar (0.04)
  - South Korea (0.04)
  - Japan > Hokkaidō
    - Hokkaidō Prefecture > Sapporo (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found