AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Extracting Parallel Sentences with Bidirectional Recurrent Neural Networks to Improve Machine Translation

Grégoire, Francis, Langlais, Philippe

arXiv.org Machine LearningJun-13-2018

Parallel sentence extraction is a task addressing the data sparsity problem found in multilingual natural language processing applications. We propose a bidirectional recurrent neural network based approach to extract parallel sentences from collections of multilingual texts. Our experiments with noisy parallel corpora show that we can achieve promising results against a competitive baseline by removing the need of specific feature engineering or additional external resources. To justify the utility of our approach, we extract sentence pairs from Wikipedia articles to train machine translation systems and show significant improvements in translation performance.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Machine Learning

1806.05559

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Africa > Middle East > Egypt > Giza Governorate > Giza (0.04)
North America > Canada > Quebec > Montreal (0.04)
(7 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Generative Neural Machine Translation

Shah, Harshil, Barber, David

arXiv.org Machine LearningJun-13-2018

We introduce Generative Neural Machine Translation (GNMT), a latent variable architecture which is designed to model the semantics of the source and target sentences. We modify an encoder-decoder translation model by adding a latent variable as a language agnostic representation which is encouraged to learn the meaning of the sentence. GNMT achieves competitive BLEU scores on pure translation tasks, and is superior when there are missing words in the source sentence. We augment the model to facilitate multilingual translation and semi-supervised learning without adding parameters. This framework significantly reduces overfitting when there is limited paired data available, and is effective for translating between pairs of languages not seen during training.

machine learning, natural language, source sentence, (19 more...)

arXiv.org Machine Learning

1806.05138

Country:

Asia > China > Beijing > Beijing (0.05)
Europe > Netherlands (0.04)
Asia > Uzbekistan (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.47)

Add feedback

Google improves Translate with offline AI

EngadgetJun-12-2018, 18:35:29 GMT

Google isn't going to sit idly by while Microsoft brings AI-based offline translation to your phone. The company is rolling out internet-free neural machine translation to its Translate apps for Android and iOS, promising much more accurate language conversion when you don't have the luxury of data. The initial release covers 58 languages, including a slew of European and Indian languages as well as common translation targets like Arabic, Chinese and Japanese. Despite the improved accuracy, the app shouldn't chew up too much of your valuable device space. Each language takes about 30MB to 40MB, Google said.

machine translation, natural language, offline ai, (1 more...)

Engadget

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Resource-Efficient Neural Architect

Zhou, Yanqi, Ebrahimi, Siavash, Arık, Sercan Ö., Yu, Haonan, Liu, Hairong, Diamos, Greg

arXiv.org Artificial IntelligenceJun-12-2018

Neural Architecture Search (NAS) is a laborious process. Prior work on automated NAS targets mainly on improving accuracy, but lacks consideration of computational resource use. We propose the Resource-Efficient Neural Architect (RENA), an efficient resource-constrained NAS using reinforcement learning with network embedding. RENA uses a policy network to process the network embeddings to generate new configurations. We demonstrate RENA on image recognition and keyword spotting (KWS) problems. RENA can find novel architectures that achieve high performance even with tight resource constraints. For CIFAR10, it achieves 2.95% test error when compute intensity is greater than 100 FLOPs/byte, and 3.87% test error when model size is less than 3M parameters. For Google Speech Commands Dataset, RENA achieves the state-of-the-art accuracy without resource constraints, and it outperforms the optimized architectures with tight resource constraints.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

1806.07912

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report (0.52)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.93)

Add feedback

Navigating with Graph Representations for Fast and Scalable Decoding of Neural Language Models

Zhang, Minjia, Liu, Xiaodong, Wang, Wenhan, Gao, Jianfeng, He, Yuxiong

arXiv.org Artificial IntelligenceJun-11-2018

Neural language models (NLMs) have recently gained a renewed interest by achieving state-of-the-art performance across many natural language processing (NLP) tasks. However, NLMs are very computationally demanding largely due to the computational cost of the softmax layer over a large vocabulary. We observe that, in decoding of many NLP tasks, only the probabilities of the top-K hypotheses need to be calculated preciously and K is often much smaller than the vocabulary size. This paper proposes a novel softmax layer approximation algorithm, called Fast Graph Decoder (FGD), which quickly identifies, for a given context, a set of K words that are most likely to occur according to a NLM. We demonstrate that FGD reduces the decoding time by an order of magnitude while attaining close to the full softmax baseline accuracy on neural machine translation and language modeling tasks. We also prove the theoretical guarantee on the softmax approximation quality.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

1806.04189

Country: Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.77)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

iParaphrasing: Extracting Visually Grounded Paraphrases via an Image

Chu, Chenhui, Otani, Mayu, Nakashima, Yuta

arXiv.org Artificial IntelligenceJun-11-2018

A paraphrase is a restatement of the meaning of a text in other words. Paraphrases have been studied to enhance the performance of many natural language processing tasks. In this paper, we propose a novel task iParaphrasing to extract visually grounded paraphrases (VGPs), which are different phrasal expressions describing the same visual concept in an image. These extracted VGPs have the potential to improve language and image multimodal tasks such as visual question answering and image captioning. How to model the similarity between VGPs is the key of iParaphrasing. We apply various existing methods as well as propose a novel neural network-based method with image attention, and report the results of the first attempt toward iParaphrasing.

computational linguistic, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

1806.04284

Country:

Asia > Japan > Honshū > Kansai > Osaka Prefecture > Osaka (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
(17 more...)

Genre:

Research Report (0.50)
Overview (0.47)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.70)

Add feedback

Deconvolution-Based Global Decoding for Neural Machine Translation

Lin, Junyang, Sun, Xu, Ren, Xuancheng, Ma, Shuming, Su, Jinsong, Su, Qi

arXiv.org Artificial IntelligenceJun-10-2018

A great proportion of sequence-to-sequence (Seq2Seq) models for Neural Machine Translation (NMT) adopt Recurrent Neural Network (RNN) to generate translation word by word following a sequential order. As the studies of linguistics have proved that language is not linear word sequence but sequence of complex structure, translation at each step should be conditioned on the whole target-side context. To tackle the problem, we propose a new NMT model that decodes the sequence with the guidance of its structural prediction of the context of the target sequence. Our model generates translation based on the structural prediction of the target-side context so that the translation can be freed from the bind of sequential order. Experimental results demonstrate that our model is more competitive compared with the state-of-the-art methods, and the analysis reflects that our model is also robust to translating sentences of different lengths and it also reduces repetition with the instruction from the target-side context for decoding.

machine learning, natural language, translation, (19 more...)

arXiv.org Artificial Intelligence

1806.03692

Country:

Europe > North Macedonia > Skopje Statistical Region > Skopje Municipality > Skopje (0.04)
Asia > Vietnam > Da Nang > Da Nang (0.04)
Asia > China > Fujian Province > Xiamen (0.04)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Generalization without systematicity: On the compositional skills of sequence-to-sequence recurrent networks

Lake, Brenden M., Baroni, Marco

arXiv.org Artificial IntelligenceJun-6-2018

Humans can understand and produce new utterances effortlessly, thanks to their compositional skills. Once a person learns the meaning of a new verb "dax," he or she can immediately understand the meaning of "dax twice" or "sing and dax." In this paper, we introduce the SCAN domain, consisting of a set of simple compositional navigation commands paired with the corresponding action sequences. We then test the zero-shot generalization capabilities of a variety of recurrent neural networks (RNNs) trained on SCAN with sequence-to-sequence methods. We find that RNNs can make successful zero-shot generalizations when the differences between training and test commands are small, so that they can apply "mix-and-match" strategies to solve the task. However, when generalization requires systematic compositional skills (as in the "dax" example above), RNNs fail spectacularly. We conclude with a proof-of-concept experiment in neural machine translation, suggesting that lack of systematicity might be partially responsible for neural networks' notorious training data thirst.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

1711.0035

Country:

North America > United States (0.46)
North America > Canada (0.29)
Europe > Netherlands (0.28)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback

So you think you're a flat-earther? You have to face Google Translate's sarcasm

#artificialintelligenceJun-5-2018, 09:57:10 GMT

These are people (psst, conspiracy theorists) who believe that the earth is flat. Even after years of research and all the evidence to the contrary, they still believe that if we walk far enough we will fall off the edge. Do you find that crazy? So when someone tried to translate the line "I am a flat-earther" to French, Translate wrote, "Je suis un fou" which literally means "I am a crazy person" in English. Of course, the spokesperson apologised profusely when this was brought to his notice, calling it a "glitch" that will be "taken care of immediately."

artificial intelligence, google translate, natural language, (7 more...)

#artificialintelligence

Country:

Europe > Russia (0.08)
Asia > Russia (0.08)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.56)

Add feedback

[1804.07878] Massively Parallel Cross-Lingual Learning in Low-Resource Target Language Translation

#artificialintelligenceJun-3-2018, 20:22:03 GMT

Which authors of this paper are endorsers? Disable MathJax (What is MathJax?)

artificial intelligence, massively parallel cross-lingual learning, natural language, (4 more...)

#artificialintelligence

Genre: Research Report (0.86)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.56)

Add feedback