AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Enhancing Speech-to-Speech Translation with Multiple TTS Targets

Shi, Jiatong, Tang, Yun, Lee, Ann, Inaguma, Hirofumi, Wang, Changhan, Pino, Juan, Watanabe, Shinji

arXiv.org Artificial IntelligenceApr-10-2023

It has been known that direct speech-to-speech translation (S2ST) models usually suffer from the data scarcity issue because of the limited existing parallel materials for both source and target speech. Therefore to train a direct S2ST system, previous works usually utilize text-to-speech (TTS) systems to generate samples in the target language by augmenting the data from speech-to-text translation (S2TT). However, there is a limited investigation into how the synthesized target speech would affect the S2ST models. In this work, we analyze the effect of changing synthesized target speech for direct S2ST models. We find that simply combining the target speech from different TTS systems can potentially improve the S2ST performances. Following that, we also propose a multi-task framework that jointly optimizes the S2ST system with multiple targets from different TTS systems. Extensive experiments demonstrate that our proposed framework achieves consistent improvements (2.8 BLEU) over the baselines on the Fisher Spanish-English dataset.

artificial intelligence, natural language, tts system, (18 more...)

arXiv.org Artificial Intelligence

2304.04618

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

DISTO: Evaluating Textual Distractors for Multi-Choice Questions using Negative Sampling based Approach

Ghanem, Bilal, Fyshe, Alona

arXiv.org Artificial IntelligenceApr-10-2023

Multiple choice questions (MCQs) are an efficient and common way to assess reading comprehension (RC). Every MCQ needs a set of distractor answers that are incorrect, but plausible enough to test student knowledge. Distractor generation (DG) models have been proposed, and their performance is typically evaluated using machine translation (MT) metrics. However, MT metrics often misjudge the suitability of generated distractors. We propose DISTO: the first learned evaluation metric for generated distractors. We validate DISTO by showing its scores correlate highly with human ratings of distractor quality. At the same time, DISTO ranks the performance of stateof-the-art Figure 1: A multi-choice question example from the DG models very differently from RACE dataset (Lai et al., 2017). The generated distractors MT-based metrics, showing that MT metrics were produced using a T5 model. Though the should not be used for distractor evaluation.

artificial intelligence, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2304.04881

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada > Alberta (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(3 more...)

Genre: Research Report > New Finding (0.46)

Industry: Education > Assessment & Standards > Student Performance (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.89)

Add feedback

Bridging Graph Position Encodings for Transformers with Weighted Graph-Walking Automata

Soga, Patrick, Chiang, David

arXiv.org Artificial IntelligenceApr-9-2023

A current goal in the graph neural network literature is to enable transformers to operate on graph-structured data, given their success on language and vision tasks. Since the transformer's original sinusoidal position encodings (PEs) are not applicable to graphs, recent work has focused on developing graph PEs, rooted in spectral graph theory or various spatial features of a graph. In this work, we introduce a new graph PE, Graph Automaton PE (GAPE), based on weighted graph-walking automata (a novel extension of graph-walking automata). We compare the performance of GAPE with other PE schemes on both machine translation and graph-structured tasks, and we show that it generalizes and connects with several other PEs. An additional contribution of this study is a theoretical and controlled experimental comparison of many recent PEs in graph transformers, independent of the use of edge features.

data mining, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2212.06898

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Genre: Research Report > Experimental Study (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
(2 more...)

Add feedback

Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder

Fu, Zihao, Lam, Wai, Yu, Qian, So, Anthony Man-Cho, Hu, Shengding, Liu, Zhiyuan, Collier, Nigel

arXiv.org Artificial IntelligenceApr-8-2023

The sequence-to-sequence (seq2seq) task aims at generating the target sequence based on the given input source sequence. Traditionally, most of the seq2seq task is resolved by the Encoder-Decoder framework which requires an encoder to encode the source sequence and a decoder to generate the target text. Recently, a bunch of new approaches have emerged that apply decoder-only language models directly to the seq2seq task. Despite the significant advancements in applying language models to the seq2seq task, there is still a lack of thorough analysis on the effectiveness of the decoder-only language model architecture. This paper aims to address this gap by conducting a detailed comparison between the encoder-decoder architecture and the decoder-only language model framework through the analysis of a regularized encoder-decoder structure. This structure is designed to replicate all behaviors in the classical decoder-only language model but has an encoder and a decoder making it easier to be compared with the classical encoder-decoder structure. Based on the analysis, we unveil the attention degeneration problem in the language model, namely, as the generation step number grows, less and less attention is focused on the source sequence. To give a quantitative understanding of this problem, we conduct a theoretical sensitivity analysis of the attention output with respect to the source input. Grounded on our analysis, we propose a novel partial attention language model to solve the attention degeneration problem. Experimental results on machine translation, summarization, and data-to-text generation tasks support our analysis and demonstrate the effectiveness of our proposed model.

computational linguistic, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2304.04052

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
(24 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

tmn at SemEval-2023 Task 9: Multilingual Tweet Intimacy Detection using XLM-T, Google Translate, and Ensemble Learning

Glazkova, Anna

arXiv.org Artificial IntelligenceApr-8-2023

The paper describes a transformer-based system designed for SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis. The purpose of the task was to predict the intimacy of tweets in a range from 1 (not intimate at all) to 5 (very intimate). The official training set for the competition consisted of tweets in six languages (English, Spanish, Italian, Portuguese, French, and Chinese). The test set included the given six languages as well as external data with four languages not presented in the training set (Hindi, Arabic, Dutch, and Korean). We presented a solution based on an ensemble of XLM-T, a multilingual RoBERTa model adapted to the Twitter domain. To improve the performance of unseen languages, each tweet was supplemented by its English translation. We explored the effectiveness of translated data for the languages seen in fine-tuning compared to unseen languages and estimated strategies for using translated data in transformer-based models. Our solution ranked 4th on the leaderboard while achieving an overall Pearson's r of 0.599 over the test set. The proposed system improves up to 0.088 Pearson's r over a score averaged across all 45 submissions.

computational linguistic, dataset, pearson, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2023.semeval-1.183

2304.04054

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Washington > King County > Seattle (0.05)
South America > Colombia > Meta Department > Villavicencio (0.04)
(4 more...)

Genre: Research Report (0.83)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Scotland - World University and School Wiki

#artificialintelligenceApr-7-2023, 19:45:20 GMT

Welcome to World University and School Wiki which anyone can add to or edit. See, too, the British Film Institute. If you were Scotland and heading for independence with a vote in the British Isles in 2014 or beyond, which currency would you choose for Scotland's long term prosperity, - institutional-wise, especially (e.g. "Like many Scots, I can clearly distinguish between independence and nationalism, and I certainly wouldn't be voting for nationalism, certainly not for tartan-la-la. Really I'd want a yes vote, then a bloodless coup the next morning, before there were any flags or triumphalism."

scotland, world university, worlduniversity, (14 more...)

#artificialintelligence

Country:

Europe > United Kingdom > England > Tyne and Wear > Newcastle (0.14)
Europe > United Kingdom > England > Greater Manchester > Manchester (0.14)
Europe > United Kingdom > Scotland > City of Edinburgh > Edinburgh (0.07)
(11 more...)

Industry:

Media (1.00)
Leisure & Entertainment (1.00)
Education > Educational Setting (1.00)
Government > Regional Government > Europe Government > United Kingdom Government > Scotland Government (0.33)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Communications > Collaboration (0.86)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.47)

Add feedback

JANUS: Speech-to-Speech Translation Using Connectionist and Non-Connectionist Techniques

Neural Information Processing SystemsApr-6-2023, 19:22:11 GMT

JANUS translates continuously spoken English and German into German, English, and Japanese. JANUS cur(cid:173) rently achieves 87% translation fidelity from English speech and 97% from German speech. We present the JANUS system along with com(cid:173) parative evaluations of its interchangeable processing components, with special emphasis on the connectionist modules.

connectionist and non-connectionist technique, janus, speech-to-speech translation, (2 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.19)
North America > United States > Minnesota > Hennepin County > Hopkins (0.12)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.65)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.48)

Add feedback

Learning from Multiple Partially Observed Views - an Application to Multilingual Text Categorization

Neural Information Processing SystemsApr-6-2023, 14:08:02 GMT

We address the problem of learning classifiers when observations have multiple views, some of which may not be observed for all examples. We assume the existence of view generating functions which may complete the missing views in an approximate way. This situation corresponds for example to learning text classifiers from multilingual collections where documents are not available in all languages. In that case, Machine Translation (MT) systems may be used to translate each document in the missing languages. We derive a generalization error bound for classifiers learned on examples with multiple artificially created views.

application, multilingual text categorization, observed view, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Preparing the Vuk'uzenzele and ZA-gov-multilingual South African multilingual corpora

Lastrucci, Richard, Dzingirai, Isheanesu, Rajab, Jenalea, Madodonga, Andani, Shingange, Matimba, Njini, Daniel, Marivate, Vukosi

arXiv.org Artificial IntelligenceApr-5-2023

This paper introduces two multilingual government themed corpora in various South African languages. The corpora were collected by gathering the South African Government newspaper (Vuk'uzenzele), as well as South African government speeches (ZA-gov-multilingual), that are translated into all 11 South African official languages. The corpora can be used for a myriad of downstream NLP tasks. The corpora were created to allow researchers to study the language used in South African government publications, with a focus on understanding how South African government officials communicate with their constituents. In this paper we highlight the process of gathering, cleaning and making available the corpora. We create parallel sentence corpora for Neural Machine Translation (NMT) tasks using Language-Agnostic Sentence Representations (LASER) embeddings. With these aligned sentences we then provide NMT benchmarks for 9 indigenous languages by fine-tuning a massively multilingual pre-trained language model.

artificial intelligence, machine translation, natural language, (15 more...)

arXiv.org Artificial Intelligence

2303.0375

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > Dominican Republic (0.04)
Asia > Indonesia > Bali (0.04)
Africa > South Africa > Gauteng > Pretoria (0.04)

Genre: Research Report (0.82)

Industry: Government > Regional Government > Africa Government > South Africa Government (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Machine Translation from Signed to Spoken Languages: State of the Art and Challenges

De Coster, Mathieu, Shterionov, Dimitar, Van Herreweghe, Mieke, Dambre, Joni

arXiv.org Artificial IntelligenceApr-5-2023

Automatic translation from signed to spoken languages is an interdisciplinary research domain, lying on the intersection of computer vision, machine translation and linguistics. Nevertheless, research in this domain is performed mostly by computer scientists in isolation. As the domain is becoming increasingly popular - the majority of scientific papers on the topic of sign language translation have been published in the past three years - we provide an overview of the state of the art as well as some required background in the different related disciplines. We give a high-level introduction to sign language linguistics and machine translation to illustrate the requirements of automatic sign language translation. We present a systematic literature review to illustrate the state of the art in the domain and then, harking back to the requirements, lay out several challenges for future research. We find that significant advances have been made on the shoulders of spoken language machine translation research. However, current approaches are often not linguistically motivated or are not adapted to the different input modality of sign languages. We explore challenges related to the representation of sign language data, the collection of datasets, the need for interdisciplinary research and requirements for moving beyond research, towards applications. Based on our findings, we advocate for interdisciplinary research and to base future research on linguistic analysis of sign languages. Furthermore, the inclusion of deaf and hearing end users of sign language translation applications in use case identification, data collection and evaluation is of the utmost importance in the creation of useful sign language translation models. We recommend iterative, human-in-the-loop, design and development of sign language translation models.

machine learning, natural language, translation, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/s10209-023-00992-1

2202.03086

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Ireland > Leinster > County Dublin > Dublin (0.14)
Europe > Netherlands (0.04)
(7 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.34)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback