AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Sequence to sequence pretraining for a less-resourced Slovenian language

Ulčar, Matej, Robnik-Šikonja, Marko

arXiv.org Artificial IntelligenceJan-2-2023

Large pretrained language models have recently conquered the area of natural language processing. As an alternative to predominant masked language modelling introduced in BERT, the T5 model has introduced a more general training objective, namely sequence to sequence transformation, which includes masked language model but more naturally fits text generation tasks such as machine translation, summarization, question answering, text simplification, dialogue systems, etc. The monolingual variants of T5 models have been limited to well-resourced languages, while the massively multilingual T5 model supports 101 languages. In contrast, we trained two different sized T5-type sequence to sequence models for morphologically rich Slovene language with much less resources and analyzed their behavior on 11 tasks. Concerning classification tasks, the SloT5 models mostly lag behind the monolingual Slovene SloBERTa model but are useful for the generative tasks.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2207.13988

Country:

Europe > Slovenia > Central Slovenia > Municipality of Ljubljana > Ljubljana (0.05)
Europe > Germany (0.04)
Europe > Slovenia > Upper Carniola > Municipality of Kranj > Kranj (0.04)
(6 more...)

Genre: Research Report (0.82)

Industry: Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.69)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
(2 more...)

Add feedback

Text Style Transfer: A Review and Experimental Evaluation

Hu, Zhiqiang, Lee, Roy Ka-Wei, Aggarwal, Charu C., Zhang, Aston

arXiv.org Artificial IntelligenceJan-1-2023

The stylistic properties of text have intrigued computational linguistics researchers in recent years. Specifically, researchers have investigated the Text Style Transfer (TST) task, which aims to change the stylistic properties of the text while retaining its style independent content. Over the last few years, many novel TST algorithms have been developed, while the industry has leveraged these algorithms to enable exciting TST applications. The field of TST research has burgeoned because of this symbiosis. This article aims to provide a comprehensive review of recent research efforts on text style transfer. More concretely, we create a taxonomy to organize the TST models and provide a comprehensive summary of the state of the art. We review the existing evaluation methodologies for TST tasks and conduct a large-scale reproducibility study where we experimentally benchmark 19 state-of-the-art TST algorithms on two publicly available datasets. Finally, we expand on current trends and provide new perspectives on the new and exciting developments in the TST field.

machine learning, natural language, reinforcement learning, (22 more...)

arXiv.org Artificial Intelligence

2010.12742

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
Oceania > Australia (0.04)
(16 more...)

Genre:

Overview (1.00)
Research Report > Experimental Study (0.67)
Research Report > New Finding (0.45)

Industry:

Education (1.00)
Information Technology (0.92)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
(4 more...)

Add feedback

Active Learning for Neural Machine Translation

Vashistha, Neeraj, Singh, Kriti, Shakya, Ramakant

arXiv.org Artificial IntelligenceDec-30-2022

The machine translation mechanism translates texts automatically between different natural languages, and Neural Machine Translation (NMT) has gained attention for its rational context analysis and fluent translation accuracy. However, processing low-resource languages that lack relevant training attributes like supervised data is a current challenge for Natural Language Processing (NLP). We incorporated a technique known Active Learning with the NMT toolkit Joey NMT to reach sufficient accuracy and robust predictions of low-resource language translation. With active learning, a semi-supervised machine learning strategy, the training algorithm determines which unlabeled data would be the most beneficial for obtaining labels using selected query techniques. We implemented two model-driven acquisition functions for selecting the samples to be validated. This work uses transformer-based NMT systems; baseline model (BM), fully trained model (FTM) , active learning least confidence based model (ALLCM), and active learning margin sampling based model (ALMSM) when translating English to Hindi. The Bilingual Evaluation Understudy (BLEU) metric has been used to evaluate system results. The BLEU scores of BM, FTM, ALLCM and ALMSM systems are 16.26, 22.56 , 24.54, and 24.20, respectively. The findings in this paper demonstrate that active learning techniques helps the model to converge early and improve the overall quality of the translation system.

machine learning, natural language, neural machine translation, (2 more...)

arXiv.org Artificial Intelligence

2301.00688

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Discussion on Building Practical NLP Leaderboards: The Case of Machine Translation

Santy, Sebastin, Bhattacharya, Prasanta

arXiv.org Artificial IntelligenceDec-30-2022

Recent advances in AI and ML applications have benefited from rapid progress in NLP research. Leaderboards have emerged as a popular mechanism to track and accelerate progress in NLP through competitive model development. While this has increased interest and participation, the over-reliance on single, and accuracy-based metrics have shifted focus from other important metrics that might be equally pertinent to consider in real-world contexts. In this paper, we offer a preliminary discussion of the risks associated with focusing exclusively on accuracy metrics and draw on recent discussions to highlight prescriptive suggestions on how to develop more practical and effective leaderboards that can better reflect the real-world utility of models.

artificial intelligence, machine learning, natural language, (13 more...)

arXiv.org Artificial Intelligence

2106.06292

Country:

Europe > Belgium > Brussels-Capital Region > Brussels (0.05)
Asia > Indonesia > Bali (0.05)
Oceania > Australia > Victoria > Melbourne (0.04)
(10 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Understanding Attention in Natural Language Processing with 3 Projects

#artificialintelligenceDec-27-2022, 00:30:07 GMT

In this blog post, I'll summarize my understanding of attention used in natural language processing (NLP). As a machine learning and NLP self-learner, when I initially got exposed to the idea of attention, I felt overwhelmed by its whole bunch of different variations and all the nitty-gritties involved in the implementations. Now, after reading articles, blogs and code, watching YouTube videos and also implementing it myself in several projects, I found it actually not that hard to understand when looking back. Hopefully by sharing what I learned along the journey, I could help some of those who are also going though that learning process, especially beginners like who I was a couple of months ago, speed up their progress and make it a bit more enjoyable. The concept of attention was firstly widely spread because of its use in the sequence-to-sequence (seq2seq) model for neural machine translation.

encoder, information, natural language processing, (4 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.96)

Add feedback

Automatic Text Simplification of News Articles in the Context of Public Broadcasting

Maupomé, Diego, Rancourt, Fanny, Soulas, Thomas, Lachance, Alexandre, Meurs, Marie-Jean, Aleksandrova, Desislava, Dufour, Olivier Brochu, Pontes, Igor, Cardon, Rémi, Simard, Michel, Vajjala, Sowmya

arXiv.org Artificial IntelligenceDec-26-2022

This report summarizes the work carried out by the authors during the Twelfth Montreal Industrial Problem Solving Workshop, held at Université de Montréal in August 2022. The team tackled a problem submitted by CBC/Radio-Canada on the theme of Automatic Text Simplification (ATS). In order to make its written content more widely accessible, and to support its second-language teaching activities, CBC/RC has recently been exploring the potential of automatic methods to simplify texts. They have developed a modular lexical simplification system (LSS), which identifies complex words in French and English texts, and replaces them with simpler, more common equivalents. Recently however, the ATS research community has proposed a number of approaches that rely on deep learning methods to perform more elaborate transformations, not limited to just lexical substitutions, but covering syntactic restructuring and conceptual simplifications as well.

machine learning, natural language, simplification, (18 more...)

arXiv.org Artificial Intelligence

2212.13317

Country:

North America > Canada > Quebec > Montreal (0.54)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Finland > Uusimaa > Helsinki (0.04)
(8 more...)

Genre:

Research Report (0.50)
Instructional Material (0.48)

Industry: Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)

Add feedback

Differentiable N-gram Objective on Abstractive Summarization

Zhu, Yunqi, Yang, Xuebing, Wu, Yuanyuan, Zhu, Mingjin, Zhang, Wensheng

arXiv.org Artificial IntelligenceDec-25-2022

ROUGE is a standard automatic evaluation metric based on n-grams for sequence-to-sequence tasks, while cross-entropy loss is an essential objective of neural network language model that optimizes at a unigram level. We present differentiable n-gram objectives, attempting to alleviate the discrepancy between training criterion and evaluating criterion. The objective maximizes the probabilistic weight of matched sub-sequences, and the novelty of our work is the objective weights the matched sub-sequences equally and does not ceil the number of matched sub-sequences by the ground truth count of n-grams in reference sequence. We jointly optimize cross-entropy loss and the proposed objective, providing decent ROUGE score enhancement over abstractive summarization dataset CNN/DM and XSum, outperforming alternative n-gram objectives.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2202.04003

Country:

North America > United States > California > San Bernardino County (0.04)
North America > United States > California > Los Angeles County (0.04)
Europe > United Kingdom > England (0.04)
(3 more...)

Genre: Research Report (0.64)

Industry:

Government > Regional Government (0.94)
Law Enforcement & Public Safety (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Optimizing Deep Transformers for Chinese-Thai Low-Resource Translation

Hao, Wenjie, Xu, Hongfei, Mu, Lingling, Zan, Hongying

arXiv.org Artificial IntelligenceDec-24-2022

In this paper, we study the use of deep Transformer translation model for the CCMT 2022 Chinese Thai low-resource machine translation task. We first explore the experiment settings (including the number of BPE merge operations, dropout probability, embedding size, etc.) for the low-resource scenario with the 6-layer Transformer. Considering that increasing the number of layers also increases the regularization on new model parameters (dropout modules are also introduced when using more layers), we adopt the highest performance setting but increase the depth of the Transformer to 24 layers to obtain improved translation quality. Our work obtains the SOTA performance in the Chinese-to-Thai translation in the constrained evaluation.

artificial intelligence, computational linguistic, natural language, (12 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-981-19-7960-6_12

2212.12662

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Belgium > Brussels-Capital Region > Brussels (0.05)
Asia > China > Hong Kong (0.05)
(8 more...)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

SYMBA: Symbolic Computation of Squared Amplitudes in High Energy Physics with Machine Learning

Alnuqaydan, Abdulhakim, Gleyzer, Sergei, Prosper, Harrison

arXiv.org Artificial IntelligenceDec-23-2022

The cross section is one of the most important physical quantities in high-energy physics and the most time consuming to compute. While machine learning has proven to be highly successful in numerical calculations in high-energy physics, analytical calculations using machine learning are still in their infancy. In this work, we use a sequence-to-sequence model, specifically, a transformer, to compute a key element of the cross section calculation, namely, the squared amplitude of an interaction. We show that a transformer model is able to predict correctly 97.6% and 99% of squared amplitudes of QCD and QED processes, respectively, at a speed that is up to orders of magnitude faster than current symbolic computation frameworks. We discuss the performance of the current model, its limitations and possible future directions for this work.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1088/2632-2153/acb2b2

2206.08901

Country: North America > United States (1.00)

Genre: Research Report (0.87)

Industry: Government > Regional Government (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (0.62)

Add feedback

Dubbing in Practice: A Large Scale Study of Human Localization With Insights for Automatic Dubbing

Brannon, William, Virkar, Yogesh, Thompson, Brian

arXiv.org Artificial IntelligenceDec-22-2022

We investigate how humans perform the task of dubbing video content from one language into another, leveraging a novel corpus of 319.57 hours of video from 54 professionally produced titles. This is the first such large-scale study we are aware of. The results challenge a number of assumptions commonly made in both qualitative literature on human dubbing and machine-learning literature on automatic dubbing, arguing for the importance of vocal naturalness and translation quality over commonly emphasized isometric (character length) and lip-sync constraints, and for a more qualified view of the importance of isochronic (timing) constraints. We also find substantial influence of the source-side audio on human dubs through channels other than the words of the translation, pointing to the need for research on ways to preserve speech characteristics, as well as semantic transfer such as emphasis/emotion, in automatic dubbing systems.

constraint, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2212.12137

Country:

Europe > Netherlands > North Holland > Amsterdam (0.05)
North America > Canada > Quebec > Montreal (0.04)
North America > Canada > Ontario > Toronto (0.04)
(13 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Leisure & Entertainment (0.94)
Media > Film (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback