Goto

Collaborating Authors

 Machine Translation


Build a Translation Application with AWS

#artificialintelligence

Amazon's suite of ML services is constantly expanding. From having capabilities of building custom ML pipelines in SageMaker to a versatile set of AutoML services, options to deploy and tackle ML problems are limitless. Neural Machine Translation is a theoretically intense field and requires deep knowledge of LSTMs and Deep Learning frameworks such as TensorFlow and PyTorch. For this article we will explore AWS Translate, a Neural Machine Translation tool that supports 71 languages and lets you build applications with a simple API call. This article is a continuation of the Auto-ML on AWS series, check out the Rekognition and Comprehend articles for the first two parts.


Aspect Sentiment Triplet Extraction Using Reinforcement Learning

arXiv.org Artificial Intelligence

Aspect Sentiment Triplet Extraction (ASTE) is the task of extracting triplets of aspect terms, their associated sentiments, and the opinion terms that provide evidence for the expressed sentiments. Previous approaches to ASTE usually simultaneously extract all three components or first identify the aspect and opinion terms, then pair them up to predict their sentiment polarities. In this work, we present a novel paradigm, ASTE-RL, by regarding the aspect and opinion terms as arguments of the expressed sentiment in a hierarchical reinforcement learning (RL) framework. We first focus on sentiments expressed in a sentence, then identify the target aspect and opinion terms for that sentiment. This takes into account the mutual interactions among the triplet's components while improving exploration and sample efficiency. Furthermore, this hierarchical RLsetup enables us to deal with multiple and overlapping triplets. In our experiments, we evaluate our model on existing datasets from laptop and restaurant domains and show that it achieves state-of-the-art performance. The implementation of this work is publicly available at https://github.com/declare-lab/ASTE-RL.


The paradox of the compositionality of natural language: a neural machine translation case study

arXiv.org Artificial Intelligence

Moving towards human-like linguistic performance is often argued to require compositional generalisation. Whether neural networks exhibit this ability is typically studied using artificial languages, for which the compositionality of input fragments can be guaranteed and their meanings algebraically composed. However, compositionality in natural language is vastly more complex than this rigid, arithmetics-like version of compositionality, and as such artificial compositionality tests do not allow us to draw conclusions about how neural models deal with compositionality in more realistic scenarios. In this work, we re-instantiate three compositionality tests from the literature and reformulate them for neural machine translation (NMT). The results highlight two main issues: the inconsistent behaviour of NMT models and their inability to (correctly) modulate between local and global processing. Aside from an empirical study, our work is a call to action: we should rethink the evaluation of compositionality in neural networks of natural language, where composing meaning is not as straightforward as doing the math.


Evaluation Metrics: Assessing the quality of NLG outputs

#artificialintelligence

In the field of machine learning, as in the most unrelated fields as well, we need some sort of evaluation. You can think of a student taking an exam, a car in a crash test, a web server on load test, and performance evaluation of a model in AI. Evaluation methods differ among these fields and evolution criteria designed marginally. This procedure is needed mainly to assess the quality of outputs of a model, and also to compare them among different models or with different setups, etc. Natural Language Generation (NLG), a field in Natural Language Processing (NLP), is an applied subfield of artificial intelligence, where the goal is to produce a textual output. It has a vast amount of subtasks like machine translation (MT), question answering (QA), summarization, question generation (QG), etc. Here, the discussion is around the performance of the models whose outputs are text.


An Overview of ML on AWS

#artificialintelligence

When you start looking at ML outside of your local notebook or environment, you start getting into the world of Cloud Computing. Providers such as AWS, Azure, and GCP are offering an incredible suite of ML services in their respective Clouds that can help you take ML to a production grade scale. What's even more incredible is ML is slowly being democratized for all programmers. As ML has expanded a lot of the theory and knowledge behind the algorithms have been abstracted out into AutoML services that enable developers with no ML experience to launch applications powered by cutting edge AI. These Auto-AI services cover a variety of different ML fields such as NLP, Computer Vision, Time-Series Forecasting, and more.


Improving Similar Language Translation With Transfer Learning

arXiv.org Artificial Intelligence

We investigate transfer learning based on pre-trained neural machine translation models to translate between (low-resource) similar languages. This work is part of our contribution to the WMT 2021 Similar Languages Translation Shared Task where we submitted models for different language pairs, including French-Bambara, Spanish-Catalan, and Spanish-Portuguese in both directions. Our models for Catalan-Spanish ($82.79$ BLEU) and Portuguese-Spanish ($87.11$ BLEU) rank top 1 in the official shared task evaluation, and we are the only team to submit models for the French-Bambara pairs.


Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior

arXiv.org Artificial Intelligence

We study continual learning for natural language instruction generation, by observing human users' instruction execution. We focus on a collaborative scenario, where the system both acts and delegates tasks to human users using natural language. We compare user execution of generated instructions to the original system intent as an indication to the system's success communicating its intent. We show how to use this signal to improve the system's ability to generate instructions via contextual bandit learning. In interaction with real users, our system demonstrates dramatic improvements in its ability to generate language over time.


Don't Take It Literally: An Edit-Invariant Sequence Loss for Text Generation

arXiv.org Artificial Intelligence

Neural text generation models are typically trained by maximizing log-likelihood with the sequence cross entropy loss, which encourages an exact token-by-token match between a target sequence with a generated sequence. Such training objective is sub-optimal when the target sequence not perfect, e.g., when the target sequence is corrupted with noises, or when only weak sequence supervision is available. To address this challenge, we propose a novel Edit-Invariant Sequence Loss (EISL), which computes the matching loss of a target n-gram with all n-grams in the generated sequence. EISL draws inspirations from convolutional networks (ConvNets) which are shift-invariant to images, hence is robust to the shift of n-grams to tolerate edits in the target sequences. Moreover, the computation of EISL is essentially a convolution operation with target n-grams as kernels, which is easy to implement with existing libraries. To demonstrate the effectiveness of EISL, we conduct experiments on three tasks: machine translation with noisy target sequences, unsupervised text style transfer, and non-autoregressive machine translation. Experimental results show our method significantly outperforms cross entropy loss on these three tasks.


GENder-IT: An Annotated English-Italian Parallel Challenge Set for Cross-Linguistic Natural Gender Phenomena

arXiv.org Artificial Intelligence

Languages differ in terms of the absence or presence of gender features, the number of gender classes and whether and where gender features are explicitly marked. These cross-linguistic differences can lead to ambiguities that are difficult to resolve, especially for sentence-level MT systems. The identification of ambiguity and its subsequent resolution is a challenging task for which currently there aren't any specific resources or challenge sets available. In this paper, we introduce gENder-IT, an English--Italian challenge set focusing on the resolution of natural gender phenomena by providing word-level gender tags on the English source side and multiple gender alternative translations, where needed, on the Italian target side.


WeChat Neural Machine Translation Systems for WMT21

arXiv.org Artificial Intelligence

This paper introduces WeChat AI's participation in WMT 2021 shared news translation task on English->Chinese, English->Japanese, Japanese->English and English->German. Our systems are based on the Transformer (Vaswani et al., 2017) with several novel and effective variants. In our experiments, we employ data filtering, large-scale synthetic data generation (i.e., back-translation, knowledge distillation, forward-translation, iterative in-domain knowledge transfer), advanced finetuning approaches, and boosted Self-BLEU based model ensemble. Our constrained systems achieve 36.9, 46.9, 27.8 and 31.3 case-sensitive BLEU scores on English->Chinese, English->Japanese, Japanese->English and English->German, respectively. The BLEU scores of English->Chinese, English->Japanese and Japanese->English are the highest among all submissions, and that of English->German is the highest among all constrained submissions.