AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Build a Translation Application with AWS

#artificialintelligenceAug-14-2021, 02:55:15 GMT

Amazon's suite of ML services is constantly expanding. From having capabilities of building custom ML pipelines in SageMaker to a versatile set of AutoML services, options to deploy and tackle ML problems are limitless. Neural Machine Translation is a theoretically intense field and requires deep knowledge of LSTMs and Deep Learning frameworks such as TensorFlow and PyTorch. For this article we will explore AWS Translate, a Neural Machine Translation tool that supports 71 languages and lets you build applications with a simple API call. This article is a continuation of the Auto-ML on AWS series, check out the Rekognition and Comprehend articles for the first two parts.

api call, aw translate, translation application, (7 more...)

#artificialintelligence

Industry: Information Technology > Security & Privacy (0.37)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Aspect Sentiment Triplet Extraction Using Reinforcement Learning

Jian, Samson Yu Bai, Nayak, Tapas, Majumder, Navonil, Poria, Soujanya

arXiv.org Artificial IntelligenceAug-13-2021

Aspect Sentiment Triplet Extraction (ASTE) is the task of extracting triplets of aspect terms, their associated sentiments, and the opinion terms that provide evidence for the expressed sentiments. Previous approaches to ASTE usually simultaneously extract all three components or first identify the aspect and opinion terms, then pair them up to predict their sentiment polarities. In this work, we present a novel paradigm, ASTE-RL, by regarding the aspect and opinion terms as arguments of the expressed sentiment in a hierarchical reinforcement learning (RL) framework. We first focus on sentiments expressed in a sentence, then identify the target aspect and opinion terms for that sentiment. This takes into account the mutual interactions among the triplet's components while improving exploration and sample efficiency. Furthermore, this hierarchical RLsetup enables us to deal with multiple and overlapping triplets. In our experiments, we evaluate our model on existing datasets from laptop and restaurant domains and show that it achieves state-of-the-art performance. The implementation of this work is publicly available at https://github.com/declare-lab/ASTE-RL.

aspect term, sentiment, triplet, (11 more...)

arXiv.org Artificial Intelligence

2108.06107

Country:

Asia > Singapore (0.05)
Oceania > Australia (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
(3 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.62)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.48)
(2 more...)

Add feedback

The paradox of the compositionality of natural language: a neural machine translation case study

Dankers, Verna, Bruni, Elia, Hupkes, Dieuwke

arXiv.org Artificial IntelligenceAug-12-2021

Moving towards human-like linguistic performance is often argued to require compositional generalisation. Whether neural networks exhibit this ability is typically studied using artificial languages, for which the compositionality of input fragments can be guaranteed and their meanings algebraically composed. However, compositionality in natural language is vastly more complex than this rigid, arithmetics-like version of compositionality, and as such artificial compositionality tests do not allow us to draw conclusions about how neural models deal with compositionality in more realistic scenarios. In this work, we re-instantiate three compositionality tests from the literature and reformulate them for neural machine translation (NMT). The results highlight two main issues: the inconsistent behaviour of NMT models and their inability to (correctly) modulate between local and global processing. Aside from an empirical study, our work is a call to action: we should rethink the evaluation of compositionality in neural networks of natural language, where composing meaning is not as straightforward as doing the math.

compositionality, idiom, translation, (17 more...)

arXiv.org Artificial Intelligence

2108.05885

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Evaluation Metrics: Assessing the quality of NLG outputs

#artificialintelligenceAug-11-2021, 16:00:10 GMT

In the field of machine learning, as in the most unrelated fields as well, we need some sort of evaluation. You can think of a student taking an exam, a car in a crash test, a web server on load test, and performance evaluation of a model in AI. Evaluation methods differ among these fields and evolution criteria designed marginally. This procedure is needed mainly to assess the quality of outputs of a model, and also to compare them among different models or with different setups, etc. Natural Language Generation (NLG), a field in Natural Language Processing (NLP), is an applied subfield of artificial intelligence, where the goal is to produce a textual output. It has a vast amount of subtasks like machine translation (MT), question answering (QA), summarization, question generation (QG), etc. Here, the discussion is around the performance of the models whose outputs are text.

evaluation metric, nlg output

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.62)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.62)

Add feedback

An Overview of ML on AWS

#artificialintelligenceAug-11-2021, 07:25:27 GMT

When you start looking at ML outside of your local notebook or environment, you start getting into the world of Cloud Computing. Providers such as AWS, Azure, and GCP are offering an incredible suite of ML services in their respective Clouds that can help you take ML to a production grade scale. What's even more incredible is ML is slowly being democratized for all programmers. As ML has expanded a lot of the theory and knowledge behind the algorithms have been abstracted out into AutoML services that enable developers with no ML experience to launch applications powered by cutting edge AI. These Auto-AI services cover a variety of different ML fields such as NLP, Computer Vision, Time-Series Forecasting, and more.

algorithm, application, sagemaker, (16 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.73)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.48)

Add feedback

Improving Similar Language Translation With Transfer Learning

Adebara, Ife, Abdul-Mageed, Muhammad

arXiv.org Artificial IntelligenceAug-11-2021

We investigate transfer learning based on pre-trained neural machine translation models to translate between (low-resource) similar languages. This work is part of our contribution to the WMT 2021 Similar Languages Translation Shared Task where we submitted models for different language pairs, including French-Bambara, Spanish-Catalan, and Spanish-Portuguese in both directions. Our models for Catalan-Spanish ($82.79$ BLEU) and Portuguese-Spanish ($87.11$ BLEU) rank top 1 in the official shared task evaluation, and we are the only team to submit models for the French-Bambara pairs.

language pair, machine translation, translation, (13 more...)

arXiv.org Artificial Intelligence

2108.03533

Country:

Europe > Bulgaria (0.05)
North America > Canada > British Columbia (0.04)
Europe > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
(3 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior

Kojima, Noriyuki, Suhr, Alane, Artzi, Yoav

arXiv.org Artificial IntelligenceAug-10-2021

We study continual learning for natural language instruction generation, by observing human users' instruction execution. We focus on a collaborative scenario, where the system both acts and delegates tasks to human users using natural language. We compare user execution of generated instructions to the original system intent as an indication to the system's success communicating its intent. We show how to use this signal to improve the system's ability to generate instructions via contextual bandit learning. In interaction with real users, our system demonstrates dramatic improvements in its ability to generate language over time.

instruction, interaction, latexit sha1, (15 more...)

arXiv.org Artificial Intelligence

2108.04812

Country: Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Leisure & Entertainment > Games (1.00)
Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.46)
(2 more...)

Add feedback

Don't Take It Literally: An Edit-Invariant Sequence Loss for Text Generation

Liu, Guangyi, Yang, Zichao, Tao, Tianhua, Liang, Xiaodan, Li, Zhen, Zhou, Bowen, Cui, Shuguang, Hu, Zhiting

arXiv.org Artificial IntelligenceAug-9-2021

Neural text generation models are typically trained by maximizing log-likelihood with the sequence cross entropy loss, which encourages an exact token-by-token match between a target sequence with a generated sequence. Such training objective is sub-optimal when the target sequence not perfect, e.g., when the target sequence is corrupted with noises, or when only weak sequence supervision is available. To address this challenge, we propose a novel Edit-Invariant Sequence Loss (EISL), which computes the matching loss of a target n-gram with all n-grams in the generated sequence. EISL draws inspirations from convolutional networks (ConvNets) which are shift-invariant to images, hence is robust to the shift of n-grams to tolerate edits in the target sequences. Moreover, the computation of EISL is essentially a convolution operation with target n-grams as kernels, which is easy to implement with existing libraries. To demonstrate the effectiveness of EISL, we conduct experiments on three tasks: machine translation with noisy target sequences, unsupervised text style transfer, and non-autoregressive machine translation. Experimental results show our method significantly outperforms cross entropy loss on these three tasks.

computational linguistic, eisl, sequence, (15 more...)

arXiv.org Artificial Intelligence

2106.15078

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
Oceania > Australia > New South Wales > Sydney (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
(23 more...)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

GENder-IT: An Annotated English-Italian Parallel Challenge Set for Cross-Linguistic Natural Gender Phenomena

Vanmassenhove, Eva, Monti, Johanna

arXiv.org Artificial IntelligenceAug-5-2021

Languages differ in terms of the absence or presence of gender features, the number of gender classes and whether and where gender features are explicitly marked. These cross-linguistic differences can lead to ambiguities that are difficult to resolve, especially for sentence-level MT systems. The identification of ambiguity and its subsequent resolution is a challenging task for which currently there aren't any specific resources or challenge sets available. In this paper, we introduce gENder-IT, an English--Italian challenge set focusing on the resolution of natural gender phenomena by providing word-level gender tags on the English source side and multiple gender alternative translations, where needed, on the Italian target side.

gender, referent, translation, (14 more...)

arXiv.org Artificial Intelligence

2108.02854

Country:

North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Spain > Valencian Community > Valencia Province > Valencia (0.04)
(6 more...)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

WeChat Neural Machine Translation Systems for WMT21

Zeng, Xianfeng, Liu, Yijin, Li, Ernan, Ran, Qiu, Meng, Fandong, Li, Peng, Xu, Jinan, Zhou, Jie

arXiv.org Artificial IntelligenceAug-5-2021

This paper introduces WeChat AI's participation in WMT 2021 shared news translation task on English->Chinese, English->Japanese, Japanese->English and English->German. Our systems are based on the Transformer (Vaswani et al., 2017) with several novel and effective variants. In our experiments, we employ data filtering, large-scale synthetic data generation (i.e., back-translation, knowledge distillation, forward-translation, iterative in-domain knowledge transfer), advanced finetuning approaches, and boosted Self-BLEU based model ensemble. Our constrained systems achieve 36.9, 46.9, 27.8 and 31.3 case-sensitive BLEU scores on English->Chinese, English->Japanese, Japanese->English and English->German, respectively. The BLEU scores of English->Chinese, English->Japanese and Japanese->English are the highest among all submissions, and that of English->German is the highest among all constrained submissions.

bleu score, computational linguistic, transformer, (15 more...)

arXiv.org Artificial Intelligence

2108.02401

Country:

Europe > Italy > Tuscany > Florence (0.05)
Oceania > Australia (0.04)
Europe > Germany > Berlin (0.04)
(7 more...)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology > Services (0.61)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback