AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

What Are The Risks And Benefits Of Artificial Intelligence?

#artificialintelligenceSep-19-2019, 22:46:41 GMT

What are the risks and benefits of artificial intelligence? It's a complicated topic, but I'll try to unpack a few key points here. Let's start with a quick definition: AI is the simulation of human intelligence by machines. Example of AI systems used regularly in developed countries include Amazon's Alexa, smart replies in Gmail, Chatbots, predictive searches in Google, and recommendations. At a baseline level, AI helps improve our everyday lives by solving pain points, streamlining processes, and advancing human knowledge.

artificial intelligence, deep learning, risk and benefit, (10 more...)

#artificialintelligence

Country: North America > United States > California (0.05)

Industry:

Information Technology (0.99)
Food & Agriculture > Agriculture (0.72)
Transportation > Ground > Road (0.50)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.75)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.55)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.49)

Add feedback

What Are The Risks And Benefits Of Artificial Intelligence?

#artificialintelligenceSep-19-2019, 22:46:41 GMT

artificial intelligence, deep learning, risk and benefit, (10 more...)

#artificialintelligence

Country: North America > United States > California (0.05)

Industry:

Information Technology (0.99)
Food & Agriculture > Agriculture (0.72)
Transportation > Ground > Road (0.50)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.75)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.55)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.49)

Add feedback

Machine Learning for Clinical Predictive Analytics

Weng, Wei-Hung

arXiv.org Machine LearningSep-19-2019

In this chapter, we provide a brief overview of applying machine learning techniques for clinical prediction tasks. We begin with a quick introduction to the concepts of machine learning and outline some of the most common machine learning algorithms. Next, we demonstrate how to apply the algorithms with appropriate toolkits to conduct machine learning experiments for clinical prediction tasks. The objectives of this chapter are to (1) understand the basics of machine learning techniques and the reasons behind why they are useful for solving clinical prediction problems, (2) understand the intuition behind some machine learning models, including regression, decision trees, and support vector machines, and (3) understand how to apply these models to clinical prediction problems using publicly available datasets via case studies.

algorithm, learning, machine learning, (12 more...)

arXiv.org Machine Learning

1909.09246

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > Wisconsin (0.04)

Genre:

Research Report > Experimental Study (0.47)
Instructional Material > Course Syllabus & Notes (0.46)

Industry:

Health & Medicine > Therapeutic Area > Oncology (0.68)
Health & Medicine > Diagnostic Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
(2 more...)

Add feedback

Global Autoregressive Models for Data-Efficient Sequence Learning

Parshakova, Tetiana, Andreoli, Jean-Marc, Dymetman, Marc

arXiv.org Artificial IntelligenceSep-19-2019

Standard autoregressive seq2seq models are easily trained by max-likelihood, but tend to show poor results under small-data conditions. We introduce a class of seq2seq models, GAMs (Global Autoregressive Models), which combine an autoregressive component with a log-linear component, allowing the use of global \textit{a priori} features to compensate for lack of data. We train these models in two steps. In the first step, we obtain an \emph{unnormalized} GAM that maximizes the likelihood of the data, but is improper for fast inference or evaluation. In the second step, we use this GAM to train (by distillation) a second autoregressive model that approximates the \emph{normalized} distribution associated with the GAM, and can be used for fast inference and evaluation. Our experiments focus on language modelling under synthetic conditions and show a strong perplexity reduction of using the second autoregressive model over the standard one.

experiment, motif, sequence, (15 more...)

arXiv.org Artificial Intelligence

1909.07063

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe (0.04)
Asia > Middle East > Jordan (0.04)
(3 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Fine-Tuning Language Models from Human Preferences

Ziegler, Daniel M., Stiennon, Nisan, Wu, Jeffrey, Brown, Tom B., Radford, Alec, Amodei, Dario, Christiano, Paul, Irving, Geoffrey

arXiv.org Machine LearningSep-18-2019

Reward learning enables the application of reinforcement learning (RL) to tasks where reward is defined by human judgment, building a model of reward by asking humans questions. Most work on reward learning has used simulated environments, but complex information about values is often expressed in natural language, and we believe reward learning for language is a key to making RL practical and safe for real-world tasks. In this paper, we build on advances in generative pretraining of language models to apply reward learning to four natural language tasks: continuing text with positive sentiment or physically descriptive language, and summarization tasks on the TL;DR and CNN/Daily Mail datasets. For stylistic continuation we achieve good results with only 5,000 comparisons evaluated by humans. For summarization, models trained with 60,000 comparisons copy whole sentences from the input but skip irrelevant preamble; this leads to reasonable ROUGE scores and very good performance according to our human labelers, but may be exploiting the fact that labelers rely on simple heuristics.

arxiv preprint arxiv, machine learning, reinforcement learning, (18 more...)

arXiv.org Machine Learning

1909.08593

Country:

Asia > China (0.28)
Asia > Middle East (0.28)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment (0.68)
Government (0.46)
Media (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.88)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Memory-Augmented Neural Networks for Machine Translation

Collier, Mark, Beel, Joeran

arXiv.org Machine LearningSep-18-2019

Memory-augmented neural networks (MANNs) have been shown to outperform other recurrent neural network architectures on a series of artificial sequence learning tasks, yet they have had limited application to real-world tasks. We evaluate direct application of Neural Turing Machines (NTM) and Differentiable Neural Computers (DNC) to machine translation. We further propose and evaluate two models which extend the attentional encoder-decoder with capabilities inspired by memory augmented neural networks. We evaluate our proposed models on IWSLT Vietnamese to English and ACL Romanian to English datasets. Our proposed models and the memory augmented neural networks perform similarly to the attentional encoder-decoder on the Vietnamese to English translation task while have a 0.3-1.9 lower BLEU score for the Romanian to English task. Interestingly, our analysis shows that despite being equipped with additional flexibility and being randomly initialized memory augmented neural networks learn an algorithm for machine translation almost identical to the attentional encoder-decoder.

attentional encoder-decoder, machine learning, natural language, (16 more...)

arXiv.org Machine Learning

1909.08314

Country: Europe > Ireland > Leinster > County Dublin > Dublin (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

r/MachineLearning - [Project] Multilingual Neural Machine Translation using Transformers with Conditional Normalization.

#artificialintelligenceSep-14-2019, 13:43:56 GMT

The goal here is similar, make the rest of the network learn a common representation, while making the normalization parameters learn language specific semantics. The One-to-Many and Many-to-One models are trained for English to French, German, Italian and Spanish Translation and Vice Versa. The Many to Many model is trained on English-French, French-English, English-German and German-English. The image stylization paper specifies how a N-style network can pick up an N 1th style through fine-tuning an existing model. Similarly, I fine-tune my Many-to-Many model to pick up Portuguese.

artificial intelligence, demo, natural language, (10 more...)

#artificialintelligence

Industry: Media > News (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Hint-Based Training for Non-Autoregressive Machine Translation

Li, Zhuohan, Lin, Zi, He, Di, Tian, Fei, Qin, Tao, Wang, Liwei, Liu, Tie-Yan

arXiv.org Machine LearningSep-14-2019

Due to the unparallelizable nature of the autoregressive factorization, AutoRegressive Translation (ART) models have to generate tokens sequentially during decoding and thus suffer from high inference latency. Non-AutoRegressive Translation (NART) models were proposed to reduce the inference time, but could only achieve inferior translation accuracy. In this paper, we proposed a novel approach to leveraging the hints from hidden states and word alignments to help the training of NART models. The results achieve significant improvement over previous NART models for the WMT14 En-De and De-En datasets and are even comparable to a strong LSTM-based ART baseline but one order of magnitude faster in inference.

artificial intelligence, machine learning, natural language, (15 more...)

arXiv.org Machine Learning

1909.06708

Genre: Research Report > Promising Solution (0.34)

Industry: Education (0.95)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.87)

Add feedback

Adaptive Scheduling for Multi-Task Learning

Jean, Sébastien, Firat, Orhan, Johnson, Melvin

arXiv.org Machine LearningSep-13-2019

To train neural machine translation models simultaneously on multiple tasks (languages), it is common to sample each task uniformly or in proportion to dataset sizes. As these methods offer little control over performance trade-offs, we explore different task scheduling approaches. We first consider existing non-adaptive techniques, then move on to adaptive schedules that over-sample tasks with poorer results compared to their respective baseline. As explicit schedules can be inefficient, especially if one task is highly over-sampled, we also consider implicit schedules, learning to scale learning rates or gradients of individual tasks instead. These techniques allow training multilingual models that perform better for low-resource language pairs (tasks with small amount of data), while minimizing negative effects on high-resource tasks.

arxiv preprint arxiv, machine learning, natural language, (17 more...)

arXiv.org Machine Learning

1909.06434

Country: North America > Canada (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Entity Projection via Machine Translation for Cross-Lingual NER

Jain, Alankar, Paranjape, Bhargavi, Lipton, Zachary C.

arXiv.org Artificial IntelligenceSep-13-2019

Although over 100 languages are supported by strong off-the-shelf machine translation systems, only a subset of them possess large annotated corpora for named entity recognition. Motivated by this fact, we leverage machine translation to improve annotation-projection approaches to cross-lingual named entity recognition. We propose a system that improves over prior entity-projection methods by: (a) leveraging machine translation systems twice: first for translating sentences and subsequently for translating entities; (b) matching entities based on orthographic and phonetic similarity; and (c) identifying matches based on distributional statistics derived from the dataset. Our approach improves upon current state-of-the-art methods for cross-lingual named entity recognition on 5 diverse languages by an average of 4.1 points. Further, our method achieves state-of-the-art F_1 scores for Armenian, outperforming even a monolingual model trained on Armenian source data.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

1909.05356

Country:

North America > United States (0.28)
Europe (0.28)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback