AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

XLDA: Cross-Lingual Data Augmentation for Natural Language Inference and Question Answering

Singh, Jasdeep, McCann, Bryan, Keskar, Nitish Shirish, Xiong, Caiming, Socher, Richard

arXiv.org Artificial IntelligenceMay-27-2019

While natural language processing systems often focus on a single language, multilingual transfer learning has the potential to improve performance, especially for low-resource languages. We introduce XLDA, cross-lingual data augmentation, a method that replaces a segment of the input text with its translation in another language. XLDA enhances performance of all 14 tested languages of the cross-lingual natural language inference (XNLI) benchmark. With improvements of up to $4.8\%$, training with XLDA achieves state-of-the-art performance for Greek, Turkish, and Urdu. XLDA is in contrast to, and performs markedly better than, a more naive approach that aggregates examples in various languages in a way that each example is solely in one language. On the SQuAD question answering task, we see that XLDA provides a $1.0\%$ performance increase on the English evaluation set. Comprehensive experiments suggest that most languages are effective as cross-lingual augmentors, that XLDA is robust to a wide range of translation quality, and that XLDA is even more effective for randomly initialized models than for pretrained models.

artificial intelligence, machine translation, natural language, (16 more...)

arXiv.org Artificial Intelligence

1905.11471

Genre: Research Report (1.00)

Industry: Leisure & Entertainment (0.47)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Microsoft Research Asia (MSRA) Leads in 2019 WMT International Machine Translation Competition

#artificialintelligenceMay-23-2019, 05:17:48 GMT

Microsoft Research Asia (MSRA) has achieved eight top places in the recent machine translation challenge organized by the 2019 fourth Conference on Machine Translation (WMT19), out of the eleven tasks it undertook. Overall, there are nineteen machine translation categories in WMT this year. MSRA achieved first place in machine translation tasks for Chinese-English, English-Finnish, English-German, English-Lithuanian, French-German, German-English, German-French and Russian-English. Three other tasks were placed second in their respective categories, which included English-Kazakh, Finnish-English and Lithuanian-English. As one of the leading machine translation competition globally, WMT is a platform for leading researchers to demonstrate their solutions, as well as to understand the continuous evolvement of machine translation technology. Now in its 14th year, more than 50 teams globally from technology companies, leading academic institutions and universities participated in a bid to demonstrate their machine translation capabilities.

artificial intelligence, microsoft research asia, natural language, (10 more...)

#artificialintelligence

Country: Asia (0.62)

Industry: Information Technology (0.60)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Google's AI can now translate your speech while keeping your voice

#artificialintelligenceMay-21-2019, 10:24:13 GMT

The new system, dubbed the Translatotron, has three components, all of which look at the speaker's audio spectrogram--a visual snapshot of the frequencies used when the sound is playing, often called a voiceprint. The first component uses a neural network trained to map the audio spectrogram in the input language to the audio spectrogram in the output language. The second converts the spectrogram into an audio wave that can be played.

artificial intelligence, machine learning, natural language, (5 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.35)

Add feedback

Google AI 'Translatotron' Can Make Anyone a Real-Time Polyglot

#artificialintelligenceMay-20-2019, 06:22:40 GMT

Google AI yesterday released its latest research result in speech-to-speech translation, the futuristic-sounding "Translatotron." Billed as the world's first end-to-end speech-to-speech translation model, Translatotron promises the potential for real-time cross-linguistic conversations with low latency and high accuracy. Humans have always dreamed of a voice-based device that could enable them to simply leap over language barriers. While advances in deep learning have contributed to highly improved accuracy in speech recognition and machine translation, smooth conversations between different language speakers remained hampered by unnatural pauses during machine processing. Google's wireless headphone Pixel Bud released in 2017 boasted real-time speech translation, but users found the practical experience less then satisfying.

machine learning, natural language, translatotron, (16 more...)

#artificialintelligence

Genre: Research Report > New Finding (0.38)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.38)

Add feedback

Amazing Google AI speaks another language in your voice

#artificialintelligenceMay-19-2019, 18:35:29 GMT

On Wednesday, Google unveiled Translatotron, an in-development speech-to-speech translation system. It's not the first system to translate speech from one language to another, but Google designed Translatotron to do something other systems can't: retain the original speaker's voice in the translated audio. In other words, the tech could make it sound like you're speaking a language you don't know -- a remarkable step forward on the path to breaking down the global language barrier. According to Google's AI blog, most speech-to-speech translation systems follow a three-step process. First they transcribe the speech.

amazing google ai speak, speech-to-speech translation system, translatotron, (6 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Google's new AI can help you speak another language in your own voice

#artificialintelligenceMay-19-2019, 18:34:10 GMT

Google Translate is one of the company's most used products. It helps people translate one language to another through typing, taking pics of text, and using speech-to-text technology. Now, the company's launching a new project called Translatotron, which will offer direct speech-to-speech translations – without even using any text. In a post on Google's AI blog, the team behind the tool explained that instead of using speech-to-text and then text-to-speech to convert voice, it relied on a new model (which runs on a neural network) to develop the new system. Get 50% off tickets if you buy now.

artificial intelligence, machine translation, natural language, (9 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.82)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.59)

Add feedback

Adaptive Attention Span in Transformers

Sukhbaatar, Sainbayar, Grave, Edouard, Bojanowski, Piotr, Joulin, Armand

arXiv.org Machine LearningMay-19-2019

Part of its success is due to its ability to model called Sequential Transformer capture long term dependencies. This is achieved (Vaswani et al., 2017). A Transformer is by taking long sequences as inputs and explicitly made of a sequence of layers that are composed of compute the relations between every token via a a block of parallel self-attention layers followed mechanism called the "self-attention" layer (Al-by a feedforward network. We refer to Vaswani Rfou et al., 2019).

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

1905.07799

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.32)

Add feedback

A Case Study: Exploiting Neural Machine Translation to Translate CUDA to OpenCL

Kim, Yonghae, Kim, Hyesoon

arXiv.org Machine LearningMay-18-2019

The sequence-to-sequence (seq2seq) model for neural machine translation has significantly improved the accuracy of language translation. There have been new efforts to use this seq2seq model for program language translation or program comparisons. In this work, we present the detailed steps of using a seq2seq model to translate CUDA programs to OpenCL programs, which both have very similar programming styles. Our work shows (i) a training input set generation method, (ii) pre/post processing, and (iii) a case study using Polybench-gpu-1.0, NVIDIA SDK, and Rodinia benchmarks.

api usage, artificial intelligence, natural language, (15 more...)

arXiv.org Machine Learning

1905.07653

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.04)
North America > United States > New Jersey > Middlesex County > Piscataway (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

An Emotion Detection System for Cantonese

Lee, John (City University of Hong Kong)

AAAI ConferencesMay-15-2019

We present the first automatic emotion detection system for Cantonese. This system classifies input text into eight emotion classes: expectancy, joy, love, surprise, anxiety, sorrow, angry, or hate. While a number of emotion corpora and lexica for Mandarin Chinese have been developed, no emotion dataset is available for Cantonese. We leverage existing Mandarin Chinese emotion resources to build the system, with support from Cantonese-Mandarin lexical mappings from a machine translation system, as well as English-Mandarin lexical mappings to handle code-switching in Cantonese input. Evaluation on a set of Cantonese sentences from social media shows promising results.

artificial intelligence, cantonese, natural language, (17 more...)

AAAI Conferences

The Thirty-Second International Flairs Conference

Country:

Asia > China > Hong Kong (0.06)
North America > United States > New York (0.04)
North America > United States > California > Santa Clara County > Stanford (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.95)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.92)

Add feedback

Synchronous Bidirectional Neural Machine Translation

Zhou, Long, Zhang, Jiajun, Zong, Chengqing

arXiv.org Artificial IntelligenceMay-12-2019

Existing approaches to neural machine translation (NMT) generate the target language sequence token by token from left to right. However, this kind of unidirectional decoding framework cannot make full use of the target-side future contexts which can be produced in a right-to-left decoding direction, and thus suffers from the issue of unbalanced outputs. In this paper, we introduce a synchronous bidirectional neural machine translation (SB-NMT) that predicts its outputs using left-to-right and right-to-left decoding simultaneously and interactively, in order to leverage both of the history and future information at the same time. Specifically, we first propose a new algorithm that enables synchronous bidirectional decoding in a single model. Then, we present an interactive decoding model in which left-to-right (right-to-left) generation does not only depend on its previously generated outputs, but also relies on future contexts predicted by right-to-left (left-to-right) decoding. We extensively evaluate the proposed SB-NMT model on large-scale NIST Chinese-English, WMT14 English-German, and WMT18 Russian-English translation tasks. Experimental results demonstrate that our model achieves significant improvements over the strong Transformer model by 3.92, 1.49 and 1.04 BLEU points respectively, and obtains the state-of-the-art performance on Chinese-English and English-German translation tasks.

artificial intelligence, natural language, translation, (18 more...)

arXiv.org Artificial Intelligence

1905.04847

Country: Asia > China (0.28)

Genre: Research Report > New Finding (0.87)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback