Improving Metrics for Speech Translation

Paonessa, Claudio, Frefel, Dominik, Vogel, Manfred

May-22-2023–arXiv.org Artificial Intelligence

We introduce Parallel Paraphrasing ($\text{Para}_\text{both}$), an augmentation method for translation metrics making use of automatic paraphrasing of both the reference and hypothesis. This method counteracts the typically misleading results of speech translation metrics such as WER, CER, and BLEU if only a single reference is available. We introduce two new datasets explicitly created to measure the quality of metrics intended to be applied to Swiss German speech-to-text systems. Based on these datasets, we show that we are able to significantly improve the correlation with human quality perception if our method is applied to commonly used metrics.

artificial intelligence, computational linguistic, natural language, (14 more...)

arXiv.org Artificial Intelligence

May-22-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - Maryland > Baltimore (0.04)
  - Pennsylvania > Philadelphia County
    - Philadelphia (0.04)
  - Michigan > Washtenaw County
    - Ann Arbor (0.04)
- Europe
  - Belgium (0.04)
  - Switzerland > Neuchâtel
    - Neuchâtel (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
- Asia > Japan
  - Hokkaidō > Hokkaidō Prefecture > Sapporo (0.04)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Machine Translation (1.00)
  - Speech > Speech Recognition (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found