AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

r/MachineLearning - [D] Translating text from Portuguese to English - unexpected funny result

#artificialintelligenceApr-17-2019, 10:48:37 GMT

While following the guide on How to Develop a Neural Machine Translation System from Scratch, I'm trying to create a translation system for Portuguese-English.

artificial intelligence, natural language, unexpected funny result, (4 more...)

#artificialintelligence

Industry: Media > News (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Google has opened its first Africa Artificial Intelligence lab in Ghana

#artificialintelligenceApr-16-2019, 14:03:57 GMT

In seconds she gets a diagnosis of the disease affecting her plant and how best to manage it to boost her production. The farmer used an app on her phone based on TensorFlow, Google's Artificial Intelligence (AI) machine that the company opensourced to help developers create solutions to real-world problems. When people think of Artificial Intelligence, they most likely think of scenes from science fiction movies, but in reality, it applies to everyday life from virtual assistants to language translation on Google, says John Quinn, an AI researcher. Google now wants to position itself as an "AI first" company and with research centers across the globe in places such as Tokyo, Zurich, New York, and Paris. And last week, the technology company opened its first center in Africa in Ghana's capital city, Accra.

artificial intelligence, machine translation, natural language, (14 more...)

#artificialintelligence

Country:

Africa > Ghana > Greater Accra > Accra (0.63)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.25)

Industry: Information Technology > Services (0.53)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.36)

Add feedback

Corpora Generation for Grammatical Error Correction

Lichtarge, Jared, Alberti, Chris, Kumar, Shankar, Shazeer, Noam, Parmar, Niki, Tong, Simon

arXiv.org Machine LearningApr-10-2019

Grammatical Error Correction (GEC) has been recently modeled using the sequence-to-sequence framework. However, unlike sequence transduction problems such as machine translation, GEC suffers from the lack of plentiful parallel data. We describe two approaches for generating large parallel datasets for GEC using publicly available Wikipedia data. The first method extracts source-target pairs from Wikipedia edit histories with minimal filtration heuristics, while the second method introduces noise into Wikipedia sentences via round-trip translation through bridge languages. Both strategies yield similar sized parallel corpora containing around 4B tokens. We employ an iterative decoding strategy that is tailored to the loosely supervised nature of our constructed corpora. We demonstrate that neural GEC models trained using either type of corpora give similar performance. Fine-tuning these models on the Lang-8 corpus and ensembling allows us to surpass the state of the art on both the CoNLL-2014 benchmark and the JFLEG task. We provide systematic analysis that compares the two approaches to data generation and highlights the effectiveness of ensembling.

artificial intelligence, natural language, translation, (18 more...)

arXiv.org Machine Learning

1904.0578

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Membership Inference Attacks on Sequence-to-Sequence Models

Hisamoto, Sorami, Post, Matt, Duh, Kevin

arXiv.org Machine LearningApr-10-2019

Data privacy is an important issue for "machine learning as a service" providers. We focus on the problem of membership inference attacks: given a data sample and black-box access to a model's API, determine whether the sample existed in the model's training data. Our contribution is an investigation of this problem in the context of sequence-to-sequence models, which are important in applications such as machine translation and video captioning. We define the membership inference problem for sequence generation, provide an open dataset based on state-of-the-art machine translation models, and report initial results on whether these models leak private information against several kinds of membership inference attacks.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

1904.05506

Country:

Europe (1.00)
North America > United States (0.68)

Genre: Research Report > New Finding (0.93)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Consistency by Agreement in Zero-shot Neural Machine Translation

Al-Shedivat, Maruan, Parikh, Ankur P.

arXiv.org Machine LearningApr-10-2019

Generalization and reliability of multilingual translation often highly depend on the amount of available parallel data for each language pair of interest. In this paper, we focus on zero-shot generalization---a challenging setup that tests models on translation directions they have not been optimized for at training time. To solve the problem, we (i) reformulate multilingual translation as probabilistic inference, (ii) define the notion of zero-shot consistency and show why standard training often results in models unsuitable for zero-shot tasks, and (iii) introduce a consistent agreement-based training method that encourages the model to produce equivalent translations of parallel sentences in auxiliary languages. We test our multilingual NMT models on multiple public zero-shot translation benchmarks (IWSLT17, UN corpus, Europarl) and show that agreement-based learning often results in 2-3 BLEU zero-shot improvement over strong baselines without any loss in performance on supervised translation directions.

large language model, machine learning, translation, (20 more...)

arXiv.org Machine Learning

1904.02338

Country:

North America > United States (0.28)
Asia > Japan > Honshū (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Modeling Recurrence for Transformer

Hao, Jie, Wang, Xing, Yang, Baosong, Wang, Longyue, Zhang, Jinfeng, Tu, Zhaopeng

arXiv.org Artificial IntelligenceApr-5-2019

Recently, the Transformer model that is based solely on attention mechanisms, has advanced the state-of-the-art on various machine translation tasks. However, recent studies reveal that the lack of recurrence hinders its further improvement of translation capacity. In response to this problem, we propose to directly model recurrence for Transformer with an additional recurrence encoder. In addition to the standard recurrent neural network, we introduce a novel attentive recurrent network to leverage the strengths of both attention and recurrent networks. Experimental results on the widely-used WMT14 English-German and WMT17 Chinese-English translation tasks demonstrate the effectiveness of the proposed approach. Our studies also reveal that the proposed model benefits from a short-cut that bridges the source and target sequences with a single recurrent layer, which outperforms its deep counterpart.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

1904.03092

Country: Asia > Macao (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

Information Aggregation for Multi-Head Attention with Routing-by-Agreement

Li, Jian, Yang, Baosong, Dou, Zi-Yi, Wang, Xing, Lyu, Michael R., Tu, Zhaopeng

arXiv.org Artificial IntelligenceApr-5-2019

Multi-head attention is appealing for its ability to jointly extract different types of information from multiple representation subspaces. Concerning the information aggregation, a common practice is to use a concatenation followed by a linear transformation, which may not fully exploit the expressiveness of multi-head attention. In this work, we propose to improve the information aggregation for multi-head attention with a more powerful routing-by-agreement algorithm. Specifically, the routing algorithm iteratively updates the proportion of how much a part (i.e. the distinct information learned from a specific subspace) should be assigned to a whole (i.e. the final output representation), based on the agreement between parts and wholes. Experimental results on linguistic probing tasks and machine translation tasks prove the superiority of the advanced information aggregation over the standard linear transformation.

artificial intelligence, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

1904.031

Country:

Asia > China > Hong Kong (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Macao (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Vision (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

Convolutional Self-Attention Networks

Yang, Baosong, Wang, Longyue, Wong, Derek, Chao, Lidia S., Tu, Zhaopeng

arXiv.org Artificial IntelligenceApr-5-2019

Self-attention networks (SANs) have drawn increasing interest due to their high parallelization in computation and flexibility in modeling dependencies. SANs can be further enhanced with multi-head attention by allowing the model to attend to information from different representation subspaces. In this work, we propose novel convolutional self-attention networks, which offer SANs the abilities to 1) strengthen dependencies among neighboring elements, and 2) model the interaction between features extracted by multiple attention heads. Experimental results of machine translation on different language pairs and model settings show that our approach outperforms both the strong Transformer baseline and other existing models on enhancing the locality of SANs. Comparing with prior studies, the proposed model is parameter free in terms of introducing no more parameters.

information, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

1904.03107

Country: Asia > Macao (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Revisiting Adversarial Autoencoder for Unsupervised Word Translation with Cycle Consistency and Improved Training

Mohiuddin, Tasnim, Joty, Shafiq

arXiv.org Machine LearningApr-4-2019

Adversarial training has shown impressive success in learning bilingual dictionary without any parallel data by mapping monolingual embeddings to a shared space. However, recent work has shown superior performance for non-adversarial methods in more challenging language pairs. In this work, we revisit adversarial autoencoder for unsupervised word translation and propose two novel extensions to it that yield more stable training and improved results. Our method includes regularization terms to enforce cycle consistency and input reconstruction, and puts the target encoders as an adversary against the corresponding discriminator. Extensive experimentations with European, non-European and low-resource languages show that our method is more robust and achieves better performance than recently proposed adversarial and non-adversarial approaches.

machine learning, mapping, natural language, (16 more...)

arXiv.org Machine Learning

1904.04116

Country: North America (0.46)

Genre:

Research Report (0.64)
Workflow (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Differentiable Sampling with Flexible Reference Word Order for Neural Machine Translation

Xu, Weijia, Niu, Xing, Carpuat, Marine

arXiv.org Machine LearningApr-4-2019

Despite some empirical success at correcting exposure bias in machine translation, scheduled sampling algorithms suffer from a major drawback: they incorrectly assume that words in the reference translations and in sampled sequences are aligned at each time step. Our new differentiable sampling algorithm addresses this issue by optimizing the probability that the reference can be aligned with the sampled output, based on a soft alignment predicted by the model itself. As a result, the output distribution at each time step is evaluated with respect to the whole predicted sequence. Experiments on IWSLT translation tasks show that our approach improves BLEU compared to maximum likelihood and scheduled sampling baselines. In addition, our approach is simpler to train with no need for sampling schedule and yields models that achieve larger improvements with smaller beam sizes.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

1904.04079

Country: North America > United States (0.29)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.36)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.36)

Add feedback