AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Twitter round-up: Google's neural machine translation system most popular AI tweet in August 2020

#artificialintelligenceSep-22-2020, 00:40:10 GMT

Verdict lists ten of the most popular tweets on artificial intelligence (AI) in August 2020 based on data from GlobalData's Influencer Platform. The top tweets were chosen from influencers as tracked by GlobalData's Influencer Platform, which is based on a scientific process that works on pre-defined parameters. Influencers are selected after a deep analysis of the influencer's relevance, network strength, engagement, and leading discussions on new and emerging trends. Ronald van Loon, principal analyst and CEO of Intelligent World, shared a video from the World Economic Forum on a neural machine translation technology developed by Google to provide natural translation between different languages using artificial intelligence and deep learning. The system was also used to translate two languages without using English as a bridge.

machine learning, natural language, twitter handle, (16 more...)

#artificialintelligence

Country:

Africa > Uganda > Central Region > Kampala (0.06)
North America > United States > New York > Suffolk County > Stony Brook (0.05)
Asia > Japan (0.05)

Industry: Banking & Finance (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback

Public Health Informatics: Proposing Causal Sequence of Death Using Neural Machine Translation

Zhu, Yuanda, Sha, Ying, Wu, Hang, Li, Mai, Hoffman, Ryan A., Wang, May D.

arXiv.org Machine LearningSep-22-2020

Each year there are nearly 57 million deaths around the world, with over 2.7 million in the United States. Timely, accurate and complete death reporting is critical in public health, as institutions and government agencies rely on death reports to analyze vital statistics and to formulate responses to communicable diseases. Inaccurate death reporting may result in potential misdirection of public health policies. Determining the causes of death is, nevertheless, challenging even for experienced physicians. To facilitate physicians in accurately reporting causes of death, we present an advanced AI approach to determine a chronically ordered sequence of clinical conditions that lead to death, based on decedent's last hospital admission discharge record. The sequence of clinical codes on the death report is named as causal chain of death, coded in the tenth revision of International Statistical Classification of Diseases (ICD-10); the priority-ordered clinical conditions on the discharge record are coded in ICD-9. We identify three challenges in proposing the causal chain of death: two versions of coding system in clinical codes, medical domain knowledge conflict, and data interoperability. To overcome the first challenge in this sequence-to-sequence problem, we apply neural machine translation models to generate target sequence. We evaluate the quality of generated sequences with the BLEU (BiLingual Evaluation Understudy) score and achieve 16.44 out of 100. To address the second challenge, we incorporate expert-verified medical domain knowledge as constraint in generating output sequence to exclude infeasible causal chains. Lastly, we demonstrate the usability of our work in a Fast Healthcare Interoperability Resources (FHIR) interface to address the third challenge.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Machine Learning

2009.10318

Country:

North America > United States > Michigan (0.05)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Asia > China > Anhui Province > Hefei (0.04)

Genre: Research Report (0.82)

Industry:

Health & Medicine > Public Health (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.68)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.68)
Government > Regional Government > North America Government > United States Government (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The Rise of the Transformers

#artificialintelligenceSep-21-2020, 14:20:21 GMT

Rise of the Transformers with Self-Attention Mechanism The intention of this article is to continue in answering the questions that my friends April Rudin, Tripp Braden, Danielle Guzman and Richard Foster-Fletcher asked about the future of AI. Furthermore Irene Iyakovet interview with me about how

large language model, machine learning, natural language, (20 more...)

#artificialintelligence

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Texas (0.04)

Genre: Research Report (0.47)

Industry:

Leisure & Entertainment (0.47)
Information Technology (0.46)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.95)

Add feedback

Generative Imagination Elevates Machine Translation

Long, Quanyu, Wang, Mingxuan, Li, Lei

arXiv.org Artificial IntelligenceSep-21-2020

There are thousands of languages on earth, but visual perception is shared among peoples. Existing multimodal neural machine translation (MNMT) methods achieve knowledge transfer by enforcing one encoder to learn shared representation across textual and visual modalities. However, the training and inference process heavily relies on well-aligned bilingual sentence - image triplets as input, which are often limited in quantity. In this paper, we hypothesize that visual imagination via synthesizing visual representation from source text could help the neural model map two languages with different symbols, thus helps the translation task. Our proposed end-to-end imagination-based machine translation model (ImagiT) first learns to generate semantic-consistent visual representation from source sentence, and then generate target sentence based on both text representation and imagined visual representation. Experiments demonstrate that our translation model benefits from visual imagination and significantly outperforms the text-only neural machine translation (NMT) baseline. We also conduct analyzing experiments, and the results show that imagination can help fill in missing information when performing the degradation strategy.

artificial intelligence, natural language, representation, (15 more...)

arXiv.org Artificial Intelligence

2009.09654

Country: Asia > China (0.04)

Genre: Research Report > New Finding (0.66)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Target Conditioning for One-to-Many Generation

Lachaux, Marie-Anne, Joulin, Armand, Lample, Guillaume

arXiv.org Machine LearningSep-21-2020

Neural Machine Translation (NMT) models often lack diversity in their generated translations, even when paired with search algorithm, like beam search. A challenge is that the diversity in translations are caused by the variability in the target language, and cannot be inferred from the source sentence alone. In this paper, we propose to explicitly model this one-to-many mapping by conditioning the decoder of a NMT model on a latent variable that represents the domain of target sentences. The domain is a discrete variable generated by a target encoder that is jointly trained with the NMT model. The predicted domain of target sentences are given as input to the decoder during training. At inference, we can generate diverse translations by decoding with different domains. Unlike our strongest baseline (Shen et al., 2019), our method can scale to any number of domains without affecting the performance or the training time. We assess the quality and diversity of translations generated by our model with several metrics, on three different datasets.

artificial intelligence, machine learning, natural language, (15 more...)

arXiv.org Machine Learning

2009.09758

Country:

Asia > Middle East > Republic of Türkiye (0.29)
Asia > Middle East > Syria (0.06)
Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.05)
Asia > Russia (0.05)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Softmax Tempering for Training Neural Machine Translation Models

Dabre, Raj, Fujita, Atsushi

arXiv.org Artificial IntelligenceSep-20-2020

Neural machine translation (NMT) models are typically trained using a softmax cross-entropy loss where the softmax distribution is compared against smoothed gold labels. In low-resource scenarios, NMT models tend to over-fit because the softmax distribution quickly approaches the gold label distribution. To address this issue, we propose to divide the logits by a temperature coefficient, prior to applying softmax, during training. In our experiments on 11 language pairs in the Asian Language Treebank dataset and the WMT 2019 English-to-German translation task, we observed significant improvements in translation quality by up to 3.9 BLEU points. Furthermore, softmax tempering makes the greedy search to be as good as beam search decoding in terms of translation quality, enabling 1.5 to 3.5 times speed-up. We also study the impact of softmax tempering on multilingual NMT and recurrently stacked NMT, both of which aim to reduce the NMT model size by parameter sharing thereby verifying the utility of temperature in developing compact NMT models. Finally, an analysis of softmax entropies and gradients reveal the impact of our method on the internal behavior of NMT models.

machine learning, natural language, softmax, (18 more...)

arXiv.org Artificial Intelligence

2009.09372

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
Europe > Germany > Berlin (0.04)
(11 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Computer Assisted Translation with Neural Quality Estimation and Automatic Post-Editing

Wang, Ke, Wang, Jiayi, Ge, Niyu, Shi, Yangbing, Zhao, Yu, Fan, Kai

arXiv.org Artificial IntelligenceSep-18-2020

With the advent of neural machine translation, there has been a marked shift towards leveraging and consuming the machine translation results. However, the gap between machine translation systems and human translators needs to be manually closed by post-editing. In this paper, we propose an end-to-end deep learning framework of the quality estimation and automatic post-editing of the machine translation output. Our goal is to provide error correction suggestions and to further relieve the burden of human translators through an interpretable model. To imitate the behavior of human translators, we design three efficient delegation modules -- quality estimation, generative post-editing, and atomic operation post-editing and construct a hierarchical model based on them. We examine this approach with the English--German dataset from WMT 2017 APE shared task and our experimental results can achieve the state-of-the-art performance. We also verify that the certified translators can significantly expedite their post-editing processing with our model in human evaluation.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2009.09126

Country:

Europe > Italy > Tuscany > Florence (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)
Asia > China (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Long-Short Term Masking Transformer: A Simple but Effective Baseline for Document-level Neural Machine Translation

Zhang, Pei, Chen, Boxing, Ge, Niyu, Fan, Kai

arXiv.org Artificial IntelligenceSep-18-2020

Many document-level neural machine translation (NMT) systems have explored the utility of context-aware architecture, usually requiring an increasing number of parameters and computational complexity. However, few attention is paid to the baseline model. In this paper, we research extensively the pros and cons of the standard transformer in document-level translation, and find that the auto-regressive property can simultaneously bring both the advantage of the consistency and the disadvantage of error accumulation. Therefore, we propose a surprisingly simple long-short term masking self-attention on top of the standard transformer to both effectively capture the long-range dependence and reduce the propagation of errors. We examine our approach on the two publicly available document-level datasets. We can achieve a strong result in BLEU and capture discourse phenomena.

artificial intelligence, natural language, transformer, (15 more...)

arXiv.org Artificial Intelligence

2009.09127

Country: Asia > China (0.04)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

The reason AI still struggles to learn languages

#artificialintelligenceSep-9-2020, 11:52:04 GMT

Common sense would tell us that it is necessary to understand a text in order to translate it. So can an artificial intelligence system actually understand a text in the same sense a human being can? The simplest approach to translation would be simply to have a computer translate word-by-word, utilizing a digitalized bilingual dictionary, and ignoring grammatical structures. Needless to say, the results of this simplistic procedure are often incomprehensible and useless. Translating between human languages requires intelligence in some form. An ideal field for AI to flex its muscles! One of the biggest dilemmas for machine translation (MT) lies in the fact that human language is full of ambiguities. The meanings of words or even entire sentences – and hence also their translations – cannot be determined in isolation, but only in context. The latter can include not only other words and sentences in the text, but also knowledge about the subject matter of the text.

learn language, machine translation, natural language, (1 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Data Weighted Training Strategies for Grammatical Error Correction

Lichtarge, Jared, Alberti, Chris, Kumar, Shankar

arXiv.org Machine LearningSep-9-2020

Recent progress in the task of Grammatical Error Correction (GEC) has been driven by addressing data sparsity, both through new methods for generating large and noisy pretraining data and through the publication of small and higher-quality finetuning data in the BEA-2019 shared task. Building upon recent work in Neural Machine Translation (NMT), we make use of both kinds of data by deriving example-level scores on our large pretraining data based on a smaller, higher-quality dataset. In this work, we perform an empirical study to discover how to best incorporate delta-log-perplexity, a type of example scoring, into a training schedule for GEC. In doing so, we perform experiments that shed light on the function and applicability of delta-log-perplexity. Models trained on scored data achieve state-of-the-art results on common GEC test sets.

computational linguistic, dataset, proceedings, (13 more...)

arXiv.org Machine Learning

2008.02976

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Italy > Tuscany > Florence (0.04)
North America > United States > Maryland > Baltimore (0.04)
(15 more...)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback