AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs

Scialom, Thomas, Dray, Paul-Alexis, Lamprier, Sylvain, Piwowarski, Benjamin, Staiano, Jacopo

arXiv.org Artificial IntelligenceJun-11-2021

Due to the discrete nature of words, language GANs require to be optimized from rewards provided by discriminator networks, via reinforcement learning methods. This is a much harder setting than for continuous tasks, which enjoy gradient flows from discriminators to generators, usually leading to dramatic learning instabilities. However, we claim that this can be solved by making discriminator and generator networks cooperate to produce output sequences during training. These cooperative outputs, inherently built to obtain higher discrimination scores, not only provide denser rewards for training, but also form a more compact artificial set for discriminator training, hence improving its accuracy and stability. In this paper, we show that our SelfGAN framework, built on this cooperative principle, outperforms Teacher Forcing and obtains state-of-the-art results on two challenging tasks, Summarization and Question Generation.

arxiv preprint arxiv, discriminator, sequence, (13 more...)

arXiv.org Artificial Intelligence

2106.06363

Country:

Europe > France > Île-de-France > Paris > Paris (0.04)
Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > Colorado (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Sports > Football (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)

Add feedback

XtremeDistilTransformers: Task Transfer for Task-agnostic Distillation

Mukherjee, Subhabrata, Awadallah, Ahmed Hassan, Gao, Jianfeng

arXiv.org Artificial IntelligenceJun-11-2021

While deep and large pre-trained models are the state-of-the-art for various natural language processing tasks, their huge size poses significant challenges for practical uses in resource constrained settings. Recent works in knowledge distillation propose task-agnostic as well as task-specific methods to compress these models, with task-specific ones often yielding higher compression rate. In this work, we develop a new task-agnostic distillation framework XtremeDistilTransformers that leverages the advantage of task-specific methods for learning a small universal model that can be applied to arbitrary tasks and languages. To this end, we study the transferability of several source tasks, augmentation resources and model architecture for distillation. We evaluate our model performance on multiple tasks, including the General Language Understanding Evaluation (GLUE) benchmark, SQuAD question answering dataset and a massive multi-lingual NER dataset with 41 languages. We release three distilled task-agnostic checkpoints with 13MM, 22MM and 33MM parameters obtaining SOTA performance in several tasks.

computational linguistic, distillation, xtremedistiltransformer, (15 more...)

arXiv.org Artificial Intelligence

2106.04563

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)
Europe > Italy > Tuscany > Florence (0.04)
Oceania > Australia (0.04)
(4 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Vivaldi adds mail, calendar, RSS and translation tools to its privacy-focused browser

EngadgetJun-9-2021, 11:03:21 GMT

Vivaldi has released a major update for its eponymous web browser for privacy-minded power users. Version 4.0 bring with it a translation tool, along with beta versions of Vivaldi Mail, Calendar, and Feed Reader. The update is available now on Windows, Mac and Linux and Android devices. Vivaldi built its translation feature into its browser. The tool is powered by Lingvanex, a Cyprus-based company that makes translator's for a wider range of platforms including voice calls and smartwatches. As part of its focus on privacy, Vivaldi says that all your translation activity will be kept away from third-parties on its servers in Iceland.

browser, translation tool, vivaldi, (7 more...)

Engadget

Country:

Europe > Middle East > Cyprus (0.27)
Europe > Iceland (0.27)

Industry:

Information Technology (0.76)
Telecommunications (0.59)

Technology:

Information Technology > Communications > Web (0.89)
Information Technology > Communications > Mobile (0.76)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.63)

Add feedback

Bayesian Attention Belief Networks

Zhang, Shujian, Fan, Xinjie, Chen, Bo, Zhou, Mingyuan

arXiv.org Machine LearningJun-9-2021

Attention-based neural networks have achieved state-of-the-art results on a wide range of tasks. Most such models use deterministic attention while stochastic attention is less explored due to the optimization difficulties or complicated model design. This paper introduces Bayesian attention belief networks, which construct a decoder network by modeling unnormalized attention weights with a hierarchy of gamma distributions, and an encoder network by stacking Weibull distributions with a deterministic-upward-stochastic-downward structure to approximate the posterior. The resulting auto-encoding networks can be optimized in a differentiable way with a variational lower bound. It is simple to convert any models with deterministic attention, including pretrained ones, to the proposed Bayesian attention belief networks. On a variety of language understanding tasks, we show that our method outperforms deterministic attention and state-of-the-art stochastic attention in accuracy, uncertainty estimation, generalization across domains, and robustness to adversarial attacks. We further demonstrate the general applicability of our method on neural machine translation and visual question answering, showing great potential of incorporating our method into various attention-related tasks.

arxiv preprint arxiv, attention weight, proceedings, (14 more...)

arXiv.org Machine Learning

2106.05251

Country:

North America > United States > Texas > Travis County > Austin (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (0.34)
Government > Military (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

Du, Cunxiao, Tu, Zhaopeng, Jiang, Jing

arXiv.org Artificial IntelligenceJun-9-2021

We propose a new training objective named order-agnostic cross entropy (OaXE) for fully non-autoregressive translation (NAT) models. OaXE improves the standard cross-entropy loss to ameliorate the effect of word reordering, which is a common source of the critical multimodality problem in NAT. Concretely, OaXE removes the penalty for word order errors, and computes the cross entropy loss based on the best possible alignment between model predictions and target tokens. Since the log loss is very sensitive to invalid references, we leverage cross entropy initialization and loss truncation to ensure the model focuses on a good part of the search space. Extensive experiments on major WMT benchmarks show that OaXE substantially improves translation performance, setting new state of the art for fully NAT models. Further analyses show that OaXE alleviates the multimodality problem by reducing token repetitions and increasing prediction confidence. Our code, data, and trained models are available at https://github.com/tencent-ailab/ICML21_OAXE.

nat model, prediction, translation, (11 more...)

arXiv.org Artificial Intelligence

2106.05093

Country:

Asia > Singapore (0.05)
Asia > China (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Example Of Machine Translation In Python And Tensorflow

#artificialintelligenceJun-8-2021, 19:55:10 GMT

We will build a deep neural network that functions as part of an end-to-end machine translation pipeline. The completed pipeline will accept English text as input and return the French translation. For our model, we will use an English and French sample of sentences. The data is located in data/small_vocab_en and data/small_vocab_fr. The small_vocab_en file contains English sentences with their French translations in the small_vocab_fr file.

neural network, sequence, translation, (13 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.36)

Add feedback

Encouraging Neural Machine Translation to Satisfy Terminology Constraints

Ailem, Melissa, Liu, Jinghsu, Qader, Raheel

arXiv.org Artificial IntelligenceJun-7-2021

We present a new approach to encourage neural machine translation to satisfy lexical constraints. Our method acts at the training step and thereby avoiding the introduction of any extra computational overhead at inference step. The proposed method combines three main ingredients. The first one consists in augmenting the training data to specify the constraints. Intuitively, this encourages the model to learn a copy behavior when it encounters constraint terms. Compared to previous work, we use a simplified augmentation strategy without source factors. The second ingredient is constraint token masking, which makes it even easier for the model to learn the copy behavior and generalize better. The third one, is a modification of the standard cross entropy loss to bias the model towards assigning high probabilities to constraint words. Empirical results show that our method improves upon related baselines in terms of both BLEU score and the percentage of generated constraint terms.

constraint, machine translation, translation, (13 more...)

arXiv.org Artificial Intelligence

2106.0373

Country: Europe > France (0.04)

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation

Goyal, Naman, Gao, Cynthia, Chaudhary, Vishrav, Chen, Peng-Jen, Wenzek, Guillaume, Ju, Da, Krishnan, Sanjana, Ranzato, Marc'Aurelio, Guzman, Francisco, Fan, Angela

arXiv.org Artificial IntelligenceJun-6-2021

One of the biggest challenges hindering progress in low-resource and multilingual machine translation is the lack of good evaluation benchmarks. Current evaluation benchmarks either lack good coverage of low-resource languages, consider only restricted domains, or are low quality because they are constructed using semi-automatic procedures. In this work, we introduce the FLORES-101 evaluation benchmark, consisting of 3001 sentences extracted from English Wikipedia and covering a variety of different topics and domains. These sentences have been translated in 101 languages by professional translators through a carefully controlled process. The resulting dataset enables better assessment of model quality on the long tail of low-resource languages, including the evaluation of many-to-many multilingual translation systems, as all translations are multilingually aligned. By publicly releasing such a high-quality and high-coverage dataset, we hope to foster progress in the machine translation community and beyond.

evaluation, machine translation, translation, (12 more...)

arXiv.org Artificial Intelligence

2106.03193

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.13)
Europe > Italy > Tuscany > Florence (0.04)
North America > Central America (0.04)
(7 more...)

Genre: Research Report > New Finding (0.67)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning

Xu, Haiyang, Yan, Ming, Li, Chenliang, Bi, Bin, Huang, Songfang, Xiao, Wenming, Huang, Fei

arXiv.org Artificial IntelligenceJun-4-2021

Vision-language pre-training (VLP) on large-scale image-text pairs has achieved huge success for the cross-modal downstream tasks. The most existing pre-training methods mainly adopt a two-step training procedure, which firstly employs a pre-trained object detector to extract region-based visual features, then concatenates the image representation and text embedding as the input of Transformer to train. However, these methods face problems of using task-specific visual representation of the specific object detector for generic cross-modal understanding, and the computation inefficiency of two-stage pipeline. In this paper, we propose the first end-to-end vision-language pre-trained model for both V+L understanding and generation, namely E2E-VLP, where we build a unified Transformer framework to jointly learn visual representation, and semantic alignments between image and text. We incorporate the tasks of object detection and image captioning into pre-training with a unified Transformer encoder-decoder architecture for enhancing visual learning. An extensive set of experiments have been conducted on well-established vision-language downstream tasks to demonstrate the effectiveness of this novel VLP paradigm.

architecture, e2e-vlp, representation, (15 more...)

arXiv.org Artificial Intelligence

2106.01804

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)

Add feedback

Part of Speech and Universal Dependency effects on English Arabic Machine Translation

Rafaeli, Ofek, Abend, Omri, Choshen, Leshem, Nikolaev, Dmitry

arXiv.org Artificial IntelligenceJun-3-2021

In this research paper, I will elaborate on a method to evaluate machine translation models based on their performance on underlying syntactical phenomena between English and Arabic languages. This method is especially important as such "neural" and "machine learning" are hard to fine-tune and change. Thus, finding a way to evaluate them easily and diversely would greatly help the task of bettering them.

artificial intelligence, natural language, translation, (14 more...)

arXiv.org Artificial Intelligence

2106.00745

Country:

Europe > United Kingdom > England (0.05)
Asia > Mongolia (0.04)
Asia > India (0.04)
Africa > South Africa > Western Cape > Cape Town (0.04)

Genre: Research Report (1.00)

Industry: Education > Educational Setting (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback