AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Understanding Attention in Natural Language Processing with 3 Projects

#artificialintelligenceDec-27-2022, 00:30:07 GMT

In this blog post, I'll summarize my understanding of attention used in natural language processing (NLP). As a machine learning and NLP self-learner, when I initially got exposed to the idea of attention, I felt overwhelmed by its whole bunch of different variations and all the nitty-gritties involved in the implementations. Now, after reading articles, blogs and code, watching YouTube videos and also implementing it myself in several projects, I found it actually not that hard to understand when looking back. Hopefully by sharing what I learned along the journey, I could help some of those who are also going though that learning process, especially beginners like who I was a couple of months ago, speed up their progress and make it a bit more enjoyable. The concept of attention was firstly widely spread because of its use in the sequence-to-sequence (seq2seq) model for neural machine translation.

encoder, information, natural language processing, (4 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.96)

Add feedback

Automatic Text Simplification of News Articles in the Context of Public Broadcasting

Maupomé, Diego, Rancourt, Fanny, Soulas, Thomas, Lachance, Alexandre, Meurs, Marie-Jean, Aleksandrova, Desislava, Dufour, Olivier Brochu, Pontes, Igor, Cardon, Rémi, Simard, Michel, Vajjala, Sowmya

arXiv.org Artificial IntelligenceDec-26-2022

This report summarizes the work carried out by the authors during the Twelfth Montreal Industrial Problem Solving Workshop, held at Université de Montréal in August 2022. The team tackled a problem submitted by CBC/Radio-Canada on the theme of Automatic Text Simplification (ATS). In order to make its written content more widely accessible, and to support its second-language teaching activities, CBC/RC has recently been exploring the potential of automatic methods to simplify texts. They have developed a modular lexical simplification system (LSS), which identifies complex words in French and English texts, and replaces them with simpler, more common equivalents. Recently however, the ATS research community has proposed a number of approaches that rely on deep learning methods to perform more elaborate transformations, not limited to just lexical substitutions, but covering syntactic restructuring and conceptual simplifications as well.

machine learning, natural language, simplification, (18 more...)

arXiv.org Artificial Intelligence

2212.13317

Country:

North America > Canada > Quebec > Montreal (0.54)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Finland > Uusimaa > Helsinki (0.04)
(8 more...)

Genre:

Research Report (0.50)
Instructional Material (0.48)

Industry: Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)

Add feedback

Differentiable N-gram Objective on Abstractive Summarization

Zhu, Yunqi, Yang, Xuebing, Wu, Yuanyuan, Zhu, Mingjin, Zhang, Wensheng

arXiv.org Artificial IntelligenceDec-25-2022

ROUGE is a standard automatic evaluation metric based on n-grams for sequence-to-sequence tasks, while cross-entropy loss is an essential objective of neural network language model that optimizes at a unigram level. We present differentiable n-gram objectives, attempting to alleviate the discrepancy between training criterion and evaluating criterion. The objective maximizes the probabilistic weight of matched sub-sequences, and the novelty of our work is the objective weights the matched sub-sequences equally and does not ceil the number of matched sub-sequences by the ground truth count of n-grams in reference sequence. We jointly optimize cross-entropy loss and the proposed objective, providing decent ROUGE score enhancement over abstractive summarization dataset CNN/DM and XSum, outperforming alternative n-gram objectives.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2202.04003

Country:

North America > United States > California > San Bernardino County (0.04)
North America > United States > California > Los Angeles County (0.04)
Europe > United Kingdom > England (0.04)
(3 more...)

Genre: Research Report (0.64)

Industry:

Government > Regional Government (0.94)
Law Enforcement & Public Safety (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Optimizing Deep Transformers for Chinese-Thai Low-Resource Translation

Hao, Wenjie, Xu, Hongfei, Mu, Lingling, Zan, Hongying

arXiv.org Artificial IntelligenceDec-24-2022

In this paper, we study the use of deep Transformer translation model for the CCMT 2022 Chinese Thai low-resource machine translation task. We first explore the experiment settings (including the number of BPE merge operations, dropout probability, embedding size, etc.) for the low-resource scenario with the 6-layer Transformer. Considering that increasing the number of layers also increases the regularization on new model parameters (dropout modules are also introduced when using more layers), we adopt the highest performance setting but increase the depth of the Transformer to 24 layers to obtain improved translation quality. Our work obtains the SOTA performance in the Chinese-to-Thai translation in the constrained evaluation.

artificial intelligence, computational linguistic, natural language, (12 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-981-19-7960-6_12

2212.12662

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Belgium > Brussels-Capital Region > Brussels (0.05)
Asia > China > Hong Kong (0.05)
(8 more...)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

SYMBA: Symbolic Computation of Squared Amplitudes in High Energy Physics with Machine Learning

Alnuqaydan, Abdulhakim, Gleyzer, Sergei, Prosper, Harrison

arXiv.org Artificial IntelligenceDec-23-2022

The cross section is one of the most important physical quantities in high-energy physics and the most time consuming to compute. While machine learning has proven to be highly successful in numerical calculations in high-energy physics, analytical calculations using machine learning are still in their infancy. In this work, we use a sequence-to-sequence model, specifically, a transformer, to compute a key element of the cross section calculation, namely, the squared amplitude of an interaction. We show that a transformer model is able to predict correctly 97.6% and 99% of squared amplitudes of QCD and QED processes, respectively, at a speed that is up to orders of magnitude faster than current symbolic computation frameworks. We discuss the performance of the current model, its limitations and possible future directions for this work.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1088/2632-2153/acb2b2

2206.08901

Country: North America > United States (1.00)

Genre: Research Report (0.87)

Industry: Government > Regional Government (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (0.62)

Add feedback

Dubbing in Practice: A Large Scale Study of Human Localization With Insights for Automatic Dubbing

Brannon, William, Virkar, Yogesh, Thompson, Brian

arXiv.org Artificial IntelligenceDec-22-2022

We investigate how humans perform the task of dubbing video content from one language into another, leveraging a novel corpus of 319.57 hours of video from 54 professionally produced titles. This is the first such large-scale study we are aware of. The results challenge a number of assumptions commonly made in both qualitative literature on human dubbing and machine-learning literature on automatic dubbing, arguing for the importance of vocal naturalness and translation quality over commonly emphasized isometric (character length) and lip-sync constraints, and for a more qualified view of the importance of isochronic (timing) constraints. We also find substantial influence of the source-side audio on human dubs through channels other than the words of the translation, pointing to the need for research on ways to preserve speech characteristics, as well as semantic transfer such as emphasis/emotion, in automatic dubbing systems.

constraint, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2212.12137

Country:

Europe > Netherlands > North Holland > Amsterdam (0.05)
North America > Canada > Quebec > Montreal (0.04)
North America > Canada > Ontario > Toronto (0.04)
(13 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Leisure & Entertainment (0.94)
Media > Film (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Improving Automated Program Repair with Domain Adaptation

Zirak, Armin, Hemmati, Hadi

arXiv.org Artificial IntelligenceDec-21-2022

Automated Program Repair (APR) is defined as the process of fixing a bug/defect in the source code, by an automated tool. APR tools have recently experienced promising results by leveraging state-of-the-art Neural Language Processing (NLP) techniques. APR tools such as TFix and CodeXGLUE combine text-to-text transformers with software-specific techniques are outperforming alternatives, these days. However, in most APR studies the train and test sets are chosen from the same set of projects. In reality, however, APR models are meant to be generalizable to new and different projects. Therefore, there is a potential threat that reported APR models with high effectiveness perform poorly when the characteristics of the new project or its bugs are different than the training set's(Domain Shift). In this study, we first define and measure the domain shift problem in automated program repair. Then, we then propose a domain adaptation framework that can adapt an APR model for a given target project. We conduct an empirical study with three domain adaptation methods FullFineTuning, TuningWithLightWeightAdapterLayers, and CurriculumLearning using two state-of-the-art domain adaptation tools (TFix and CodeXGLUE) and two APR models on 611 bugs from 19 projects. The results show that our proposed framework can improve the effectiveness of TFix by 13.05% and CodeXGLUE by 23.4%. Another contribution of this study is the proposal of a data synthesis method to address the lack of labelled data in APR. We leverage transformers to create a bug generator model. We use the generated synthetic data to domain adapt TFix and CodeXGLUE on the projects with no data (Zero-shot learning), which results in an average improvement of 5.76% and 24.42% for TFix and CodeXGLUE, respectively.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2212.11414

Country: North America > Canada > Alberta > Census Division No. 6 > Calgary Metropolitan Region > Calgary (0.14)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.96)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.88)

Add feedback

AIhub monthly digest: December 2022 – AI around the world, teleoperation, and multilingual translation

AIHubDec-20-2022, 10:17:49 GMT

Welcome to our December 2022 monthly digest, where you can catch up with any AIhub stories you may have missed, get the low-down on recent events, and much more. This month, we hear from best paper award winners at ICIP and NeurIPS, and find out more about teleoperation, multilingual translation, and quality-diversity algorithms. We also have exciting news, in the form of a new focus series. We're delighted to announce the launch of our new focus series on AI around the world, where we cover exciting applications of AI across the globe. To kick off the series, we spoke with Rose Nakasi.

algorithm, monthly digest, multilingual translation, (14 more...)

AIHub

Country: Oceania > Australia (0.05)

Genre: Personal > Honors > Award (0.36)

Industry: Health & Medicine (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.35)

Add feedback

A Mutation-based Text Generation for Adversarial Machine Learning Applications

Guerrero, Jesus, Liang, Gongbo, Alsmadi, Izzat

arXiv.org Artificial IntelligenceDec-20-2022

Currently, text generation is widely used in Machine Learning (ML)-based or Artificial Intelligence (AI)-based natural language applications such as language to language translation, document summary, headline or abstract generation. Those applications can be classified into different categories. In one classification, they can be divided into short versus long text generation applications. Short text generation applications include examples such as predicting next word or statement, image caption generation, short language translation, and documents summarization. Long text generation applications include long text story completion, review generation, language translation, poetry generation, and question answering.

artificial intelligence, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2212.11808

Country: North America > United States > Texas > Bexar County > San Antonio (0.14)

Genre: Research Report (0.64)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Parameter-efficient Zero-shot Transfer for Cross-Language Dense Retrieval with Adapters

Yang, Eugene, Nair, Suraj, Lawrie, Dawn, Mayfield, James, Oard, Douglas W.

arXiv.org Artificial IntelligenceDec-20-2022

A popular approach to creating a zero-shot cross-language retrieval model is to substitute a monolingual pretrained language model in the retrieval model with a multilingual pretrained language model such as Multilingual BERT. This multilingual model is fined-tuned to the retrieval task with monolingual data such as English MS MARCO using the same training recipe as the monolingual retrieval model used. However, such transferred models suffer from mismatches in the languages of the input text during training and inference. In this work, we propose transferring monolingual retrieval models using adapters, a parameter-efficient component for a transformer network. By adding adapters pretrained on language tasks for a specific language with task-specific adapters, prior work has shown that the adapter-enhanced models perform better than fine-tuning the entire model when transferring across languages in various NLP tasks. By constructing dense retrieval models with adapters, we show that models trained with monolingual data are more effective than fine-tuning the entire model when transferring to a Cross Language Information Retrieval (CLIR) setting. However, we found that the prior suggestion of replacing the language adapters to match the target language at inference time is suboptimal for dense retrieval models. We provide an in-depth analysis of this discrepancy between other cross-language NLP tasks and CLIR.

information retrieval, large language model, natural language, (15 more...)

arXiv.org Artificial Intelligence

2212.10448

Country:

North America > United States > Maryland > Prince George's County > College Park (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Maryland > Baltimore (0.04)
(2 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.93)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.68)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.62)

Add feedback