AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Artificial Intelligence Translation: Who benefits? Wolfestone Translation

#artificialintelligenceMar-5-2020, 09:24:27 GMT

As part of January's The Future of Translation series, we delved into our predictions on what was next for the Language Services Industry in 2020 and beyond. One of the most exciting developments that we mentioned in our series was the evolution of Artificial Intelligence Translation. But the question remains: Why should you care? What exactly does the evolution, and adoption, of AI-powered translation mean for your business? So, without further ado… Let's get into it!

ai translation, artificial intelligence translation, translation, (10 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

An Empirical Accuracy Law for Sequential Machine Translation: the Case of Google Translate

Sequeira, Lucas Nunes, Moreschi, Bruno, Cozman, Fabio Gagliardi, Fontes, Bernardo

arXiv.org Machine LearningMar-5-2020

We have established, through empirical testing, a law that relates the number of translating hops to translation accuracy in sequential machine translation in Google Translate. Both accuracy and size decrease with the number of hops; the former displays a decrease closely following a power law. Such a law allows one to predict the behavior of translation chains that may be built as society increasingly depends on automated devices.

accuracy, machine translation, translation, (11 more...)

arXiv.org Machine Learning

2003.02817

Country:

South America > Brazil > São Paulo (0.04)
South America > Brazil > Santa Catarina (0.04)
North America > Mexico (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

BERT as a Teacher: Contextual Embeddings for Sequence-Level Reward

Schmidt, Florian, Hofmann, Thomas

arXiv.org Machine LearningMar-5-2020

Measuring the quality of a generated sequence against a set of references is a central problem in many learning frameworks, be it to compute a score, to assign a reward, or to perform discrimination. Despite great advances in model architectures, metrics that scale independently of the number of references are still based on n-gram estimates. We show that the underlying operations, counting words and comparing counts, can be lifted to embedding words and comparing embeddings. An in-depth analysis of BERT embeddings shows empirically that contextual embeddings can be employed to capture the required dependencies while maintaining the necessary scalability through appropriate pruning and smoothing techniques. We cast unconditional generation as a reinforcement learning problem and show that our reward function indeed provides a more effective learning signal than n-gram reward in this challenging setting.

bert, computational linguistic, sequence, (15 more...)

arXiv.org Machine Learning

2003.02738

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > China > Hong Kong (0.04)
Europe > Italy > Tuscany > Florence (0.04)
(8 more...)

Genre: Research Report > Experimental Study (0.46)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.70)

Add feedback

The Future of Computing is Distributed

#artificialintelligenceMar-2-2020, 06:19:42 GMT

Distributed applications are not new. The first distributed applications were developed over 50 years ago with the arrival of computer networks, such as ARPANET. Since then, developers have leveraged distributed systems to scale out applications and services, including large-scale simulations, web serving, and big data processing. In my own career, which started more than 20 years ago, I have worked on distributed systems in the context of the internet, peer-to-peer networks, big data, and now, machine learning. However, until recently, distributed applications have been the exception, rather than the norm.

application, moore, workload, (9 more...)

#artificialintelligence

Industry: Information Technology > Software (0.37)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.32)

Add feedback

Transformer++

Thapak, Prakhar, Hore, Prodip

arXiv.org Machine LearningMar-2-2020

Recent advancements in attention mechanisms have replaced recurrent neural networks and its variants for machine translation tasks. Transformer using attention mechanism solely achieved state-of-the-art results in sequence modeling. Neural machine translation based on the attention mechanism is parallelizable and addresses the problem of handling long-range dependencies among words in sentences more effectively than recurrent neural networks. One of the key concepts in attention is to learn three matrices, query, key, and value, where global dependencies among words are learned through linearly projecting word embeddings through these matrices. Multiple query, key, value matrices can be learned simultaneously focusing on a different subspace of the embedded dimension, which is called multi-head in Transformer. We argue that certain dependencies among words could be learned better through an intermediate context than directly modeling word-word dependencies. This could happen due to the nature of certain dependencies or lack of patterns that lend them difficult to be modeled globally using multi-head self-attention. In this work, we propose a new way of learning dependencies through a context in multi-head using convolution. This new form of multi-head attention along with the traditional form achieves better results than Transformer on the WMT 2014 English-to-German and English-to-French translation tasks. We also introduce a framework to learn POS tagging and NER information during the training of encoder which further improves results achieving a new state-of-the-art of 32.1 BLEU, better than existing best by 1.4 BLEU, on the WMT 2014 English-to-German and 44.6 BLEU, better than existing best by 1.1 BLEU, on the WMT 2014 English-to-French translation tasks. We call this Transformer++.

dependency, machine translation, translation, (17 more...)

arXiv.org Machine Learning

2003.04974

Country:

North America > United States > California > San Diego County > San Diego (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > Puerto Rico (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Towards Automatic Face-to-Face Translation

R, Prajwal K, Mukhopadhyay, Rudrabha, Philip, Jerin, Jha, Abhishek, Namboodiri, Vinay, Jawahar, C. V.

arXiv.org Artificial IntelligenceMar-1-2020

In light of the recent breakthroughs in automatic machine translation systems, we propose a novel approach that we term as "Face-to-Face Translation". As today's digital communication becomes increasingly visual, we argue that there is a need for systems that can automatically translate a video of a person speaking in language A into a target language B with realistic lip synchronization. In this work, we create an automatic pipeline for this problem and demonstrate its impact on multiple real-world applications. First, we build a working speech-to-speech translation system by bringing together multiple existing modules from speech and language. We then move towards "Face-to-Face Translation" by incorporating a novel visual module, LipGAN for generating realistic talking faces from the translated audio. Quantitative evaluation of LipGAN on the standard LRW test set shows that it significantly outperforms existing approaches across all standard metrics. We also subject our Face-to-Face Translation pipeline, to multiple human evaluations and show that it can significantly improve the overall user experience for consuming and interacting with multimodal content across languages. Code, models and demo video are made publicly available. Demo video: https://www.youtube.com/watch?v=aHG6Oei8jF0 Code and models: https://github.com/Rudrabha/LipGAN

speech, translation, video, (12 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3343031.3351066

2003.00418

Country:

Europe > France > Provence-Alpes-Côte d'Azur > Alpes-Maritimes > Nice (0.04)
North America > United States > New York (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre:

Research Report > Promising Solution (0.48)
Research Report > New Finding (0.46)
Overview > Innovation (0.34)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Uncertainty in Structured Prediction

Malinin, Andrey, Gales, Mark

arXiv.org Artificial IntelligenceFeb-28-2020

Uncertainty estimation is important for ensuring safety and robustness of AI systems, especially for high-risk applications. While much progress has recently been made in this area, most research has focused on un-structured prediction, such as image classification and regression tasks. However, while task-specific forms of confidence score estimation have been investigated by the speech and machine translation communities, limited work has investigated general uncertainty estimation approaches for structured prediction. Thus, this work aims to investigate uncertainty estimation for structured prediction tasks within a single unified and interpretable probabilistic ensemble-based framework. We consider uncertainty estimation for sequence data at the token-level and complete sequence-level, provide interpretations for, and applications of, various measures of uncertainty and discuss the challenges associated with obtaining them. This work also explores the practical challenges associated with obtaining uncertainty estimates for structured predictions tasks and provides baselines for token-level error detection, sequence-level prediction rejection, and sequence-level out-of-domain input detection using ensembles of auto-regressive transformer models trained on the WMT'14 English-French and WMT'17 English-German translation and LibriSpeech speech recognition datasets.

detection, ensemble, knowledge uncertainty, (15 more...)

arXiv.org Artificial Intelligence

2002.0765

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.28)
Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
(2 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
(2 more...)

Add feedback

Across the Language Barrier

Communications of the ACMFeb-25-2020, 12:33:53 GMT

Waverly Labs' Ambassador, an over-the-ear translation device, can support up to 20 languages and 42 dialects. The greatest obstacle to international understanding is the barrier of language," wrote British scholar and author Christopher Dawson in November 1957, believing that relying on live, human translators to accurately capture and reflect a speaker's meaning, inflection, and emotion was too great of a challenge to overcome. More than 60 years later, Dawson's theory may finally be proven outdated, thanks to the development of powerful, portable real-time translation devices. The convergence of natural language processing technology, machine learning algorithms, and powerful portable chipsets has led to the development of new devices and applications that allow real-time, two-way translation of speech and text. Language translation devices are capable of listening to an audio source in one language, translating what is being said into another language, and then translating a ...

pocketalk, translation, translation device, (15 more...)

Communications of the ACM

Country:

Asia > Japan (0.15)
South America > Brazil (0.04)
North America > United States > New York > Nassau County > Lynbrook (0.04)
(2 more...)

Industry:

Education > Educational Setting (0.70)
Leisure & Entertainment > Sports > Olympic Games (0.30)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Accessing Higher-level Representations in Sequential Transformers with Feedback Memory

Fan, Angela, Lavril, Thibaut, Grave, Edouard, Joulin, Armand, Sukhbaatar, Sainbayar

arXiv.org Machine LearningFeb-21-2020

Transformers are feedforward networks that can process input tokens in parallel. While this parallelization makes them computationally efficient, it restricts the model from fully exploiting the sequential nature of the input - the representation at a given layer can only access representations from lower layers, rather than the higher level representations already built in previous time steps. In this work, we propose the Feedback Transformer architecture that exposes all previous representations to all future representations, meaning the lowest representation of the current timestep is formed from the highest-level abstract representation of the past. We demonstrate on a variety of benchmarks in language modeling, neural machine translation, summarization, and reinforcement learning that the increased representation capacity can improve over Transformer baselines.

architecture, representation, transformer, (15 more...)

arXiv.org Machine Learning

2002.09402

Country: Europe > France > Île-de-France > Paris > Paris (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.53)

Add feedback

The Attention Mechanism in Natural Language Processing - seq2seq

#artificialintelligenceFeb-20-2020, 14:04:17 GMT

The Attention mechanism is now an established technique in many NLP tasks. I've heard about it often, but wanted to go a bit more deep and understand the details. In this first blog post - since I plan to publish a few more blog posts regarding the attention subject - I make an introduction by focusing in the first proposal of attention mechanism, as applied to the task of neural machine translation. To the best of my knowledge the attention mechanism within the context of NLP was first presented in "Neural Machine Translation by Jointly Learning to Align and Translate" at ICLR 2015 (Bahdanau et al. 2015). This was proposed in the context of machine translation, where given a sentence in one language, the model has to produce a translation for that sentence in another language.

context vector, sequence, vector, (12 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback