Advancing Speech Translation: A Corpus of Mandarin-English Conversational Telephone Speech
Wotherspoon, Shannon, Hartmann, William, Snover, Matthew
–arXiv.org Artificial Intelligence
This paper introduces a set of English translations for a 123-hour subset of the CallHome Mandarin Chinese data and the HKUST Mandarin Telephone Speech data for the task of speech translation. Paired source-language speech and target-language text is essential for training end-to-end speech translation systems and can provide substantial performance improvements for cascaded systems as well, relative to training on more widely available text data sets. We demonstrate that fine-tuning a general-purpose translation model to our Mandarin-English conversational telephone speech training set improves target-domain BLEU by more than 8 points, highlighting the importance of matched training data.
arXiv.org Artificial Intelligence
Mar-25-2024
- Country:
- North America > United States
- Asia
- Middle East > Jordan (0.05)
- China (0.05)
- Genre:
- Research Report (0.40)
- Technology:
- Information Technology > Artificial Intelligence
- Speech > Speech Recognition (1.00)
- Natural Language > Machine Translation (1.00)
- Machine Learning (1.00)
- Information Technology > Artificial Intelligence