AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Neural Machine Translation System of Indic Languages -- An Attention based Approach

Shah, Parth, Bakrola, Vishvajit

arXiv.org Machine LearningFeb-2-2020

Neural machine translation (NMT) is a recent and effective technique which led to remarkable improvements in comparison of conventional machine translation techniques. Proposed neural machine translation model developed for the Gujarati language contains encoder-decoder with attention mechanism. In India, almost all the languages are originated from their ancestral language - Sanskrit. They are having inevitable similarities including lexical and named entity similarity. Translating into Indic languages is always be a challenging task. In this paper, we have presented the neural machine translation system (NMT) that can efficiently translate Indic languages like Hindi and Gujarati that together covers more than 58.49 percentage of total speakers in the country. We have compared the performance of our NMT model with automatic evaluation matrices such as BLEU, perplexity and TER matrix. The comparison of our network with Google translate is also presented where it outperformed with a margin of 6 BLEU score on English-Gujarati translation.

machine translation, translation, translation system, (13 more...)

arXiv.org Machine Learning

doi: 10.1109/ICACCP.2019.8882969

2002.02758

Country:

Asia > India (0.27)
Asia > Singapore (0.05)
Europe > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
(2 more...)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

Bertrand-DR: Improving Text-to-SQL using a Discriminative Re-ranker

Kelkar, Amol, Relan, Rohan, Bhardwaj, Vaishali, Vaichal, Saurabh, Relan, Peter

arXiv.org Machine LearningFeb-2-2020

To access data stored in relational databases, users need to understand the database schema and write a query using a query language such as SQL. To simplify this task, text-to-SQL models attempt to translate a user's natural language question to corresponding SQL query. Recently, several generative text-to-SQL models have been developed. We propose a novel discriminative re-ranker to improve the performance of generative text-to-SQL models by extracting the best SQL query from the beam output predicted by the text-to-SQL generator, resulting in improved performance in the cases where the best query was in the candidate list, but not at the top of the list. We build the re-ranker as a schema agnostic BERT fine-tuned classifier. We analyze relative strengths of the text-to-SQL and re-ranker models across different query hardness levels, and suggest how to combine the two models for optimal performance. We demonstrate the effectiveness of the re-ranker by applying it to two state-of-the-art text-to-SQL models, and achieve top 4 score on the Spider leaderboard at the time of writing this article.

bertrand-dr, query, text-to-sql model, (14 more...)

arXiv.org Machine Learning

2002.00557

Country:

Asia > Middle East > Jordan (0.05)
North America > United States > California > San Diego County > San Diego (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Databases (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)

Add feedback

Word Sense Disambiguation

#artificialintelligenceFeb-1-2020, 17:36:13 GMT

The history and development of Artificial Intelligence has seen numerous peaks and troughs. Hype around what machines can accomplish lead to boosts in AI funding while unmet expectations cripple the industry until the next breakthrough. The term AI Winter refers to periods in history of reduced funding and interest in artificial intelligence development. During the cold war, there was an increased interest in Machine Translation to automate the translation of Russian documents into English. This time period also coincided with massive strides in linguistic developments and the early career of the famed linguist Noam Chomsky.

machine translation, translation, word sense disambiguation, (6 more...)

#artificialintelligence

Industry: Government > Military (0.37)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Massively Multilingual Document Alignment with Cross-lingual Sentence-Mover's Distance

El-Kishky, Ahmed, Guzmán, Francisco

arXiv.org Machine LearningJan-31-2020

Cross-lingual document alignment aims to identify pairs of documents in two distinct languages that are of comparable content or translations of each other. Such aligned data can be used for a variety of NLP tasks from training cross-lingual representations to mining parallel bitexts for machine translation training. In this paper we develop an unsupervised scoring function that leverages cross-lingual sentence embeddings to compute the semantic distance between documents in different languages. These semantic distances are then used to guide a document alignment algorithm to properly pair cross-lingual web documents across a variety of low, mid, and high-resource language pairs. Recognizing that our proposed scoring function and other state of the art methods are computationally intractable for long web documents, we utilize a more tractable greedy algorithm that performs comparably. We experimentally demonstrate that our distance metric performs better alignment than current baselines outperforming them by 7% on high-resource language pairs, 15% on mid-resource language pairs, and 22% on low-resource language pairs

document pair, representation, target document, (15 more...)

arXiv.org Machine Learning

2002.00761

Country:

North America > United States > District of Columbia > Washington (0.05)
North America > United States > California > San Mateo County > Menlo Park (0.04)
North America > United States > New Jersey > Middlesex County > Piscataway (0.04)
(2 more...)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.90)

Add feedback

AMR Similarity Metrics from Principles

Opitz, Juri, Parcalabescu, Letitia, Frank, Anette

arXiv.org Artificial IntelligenceJan-29-2020

Different metrics have been proposed to compare Abstract Meaning Representation (AMR) graphs. The canonical Smatch metric (Cai and Knight, 2013) aligns variables from one graph to another and compares the matching triples. The recently released SemBleu metric (Song and Gildea, 2019) is based on the machine-translation metric Bleu (Papineni et al., 2002), increasing computational efficiency by ablating a variable-alignment step and aiming at capturing more global graph properties. Our aims are threefold: i) we establish criteria that allow us to perform a principled comparison between metrics of symbolic meaning representations like AMR; ii) we undertake a thorough analysis of Smatch and SemBleu where we show that the latter exhibits some undesirable properties. E.g., it violates the identity of indiscernibles rule and introduces biases that are hard to control; iii) we propose a novel metric S2match that is more benevolent to only very slight meaning deviations and targets the fulfilment of all established criteria. We assess its suitability and show its advantages over Smatch and SemBleu.

amr graph, emb leu, graph, (12 more...)

arXiv.org Artificial Intelligence

2001.10929

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Maryland > Baltimore (0.04)
North America > Canada (0.04)
(15 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Unsupervised Multilingual Alignment using Wasserstein Barycenter

Lian, Xin, Jain, Kshitij, Truszkowski, Jakub, Poupart, Pascal, Yu, Yaoliang

arXiv.org Machine LearningJan-28-2020

We study unsupervised multilingual alignment, the problem of finding word-to-word translations between multiple languages without using any parallel data. One popular strategy is to reduce multilingual alignment to the much simplified bilingual setting, by picking one of the input languages as the pivot language that we transit through. However, it is well-known that transiting through a poorly chosen pivot language (such as English) may severely degrade the translation quality, since the assumed transitive relations among all pairs of languages may not be enforced in the training process. Instead of going through a rather arbitrarily chosen pivot language, we propose to use the Wasserstein barycenter as a more informative ''mean'' language: it encapsulates information from all languages and minimizes all pairwise transportation costs. We evaluate our method on standard benchmarks and demonstrate state-of-the-art performances.

alignment, barycenter, translation, (12 more...)

arXiv.org Machine Learning

2002.00743

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)
North America > Canada > Ontario > Toronto (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Generating Representative Headlines for News Stories

Gu, Xiaotao, Mao, Yuning, Han, Jiawei, Liu, Jialu, Yu, Hongkun, Wu, You, Yu, Cong, Finnie, Daniel, Zhai, Jiaqi, Zukoski, Nicholas

arXiv.org Artificial IntelligenceJan-27-2020

Millions of news articles are published online every day, which can be overwhelming for readers to follow. Grouping articles that are reporting the same event into news stories is a common way of assisting readers in their news consumption. However, it remains a challenging research problem to efficiently and effectively generate a representative headline for each story. Automatic summarization of a document set has been studied for decades, while few studies have focused on generating representative headlines for a set of articles. Unlike summaries, which aim to capture most information with least redundancy, headlines aim to capture information jointly shared by the story articles in short length, and exclude information that is too specific to each individual article. In this work, we study the problem of generating representative headlines for news stories. We develop a distant supervision approach to train large-scale generation models without any human annotation. This approach centers on two technical components. First, we propose a multi-level pre-training framework that incorporates massive unlabeled corpus with different quality-vs.-quantity balance at different levels. We show that models trained within this framework outperform those trained with pure human curated corpus. Second, we propose a novel self-voting-based article attention layer to extract salient information shared by multiple articles. We show that models that incorporate this layer are robust to potential noises in news stories and outperform existing baselines with or without noises. We can further enhance our model by incorporating human labels, and we show our distant supervision approach significantly reduces the demand on labeled data.

computational linguistic, news story, proceedings, (14 more...)

arXiv.org Artificial Intelligence

2001.09386

Country:

North America > United States > Alabama (0.05)
Asia > Taiwan > Taiwan Province > Taipei (0.05)
Europe > Italy > Tuscany > Florence (0.04)
(7 more...)

Genre: Research Report (0.64)

Industry:

Media > News (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Leisure & Entertainment > Sports > Baseball (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.67)

Add feedback

Otter.ai expands in Japan in partnership with NTT DOCOMO

#artificialintelligenceJan-24-2020, 10:15:14 GMT

Otter.ai to Bring AI-Powered Meeting Note Collaboration Service to Japan in partnership with NTT DOCOMO Partnership includes Investment and Customer Trials of Otter's Real-Time Transcription Los Altos, CA, January 23, 2020 –Otter.ai DOCOMO made a strategic investment in Otter through its wholly-owned subsidiary NTT DOCOMO Ventures, Inc. and announced plans for its AI-based translation service subsidiary to integrate Otter's meeting note collaboration into its offering to provide highly accurate English transcripts translated into Japanese. As a part of Otter's customer engagement with DOCOMO the Otter Voice Meeting Notes application is being used on a trial basis in Berlitz Corporation's English language classes in Japan. Students use Otter to transcribe and review the content of lessons, click on sections of text, and initiate voice playback. DOCOMO, Otter.ai and Berlitz are expanding their collaboration in language education to verify Otter's effectiveness in the study of English DOCOMO is featuring Otter during demonstrations at the DOCOMO Open House 2020, taking place in the Tokyo Big Sight exhibition complex January 23 and 24, 2020.

docomo, otter, partnership, (7 more...)

#artificialintelligence

Country:

North America > United States > California > Santa Clara County > Los Altos (0.26)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.26)

Genre: Press Release (0.38)

Industry:

Telecommunications (1.00)
Information Technology > Services (0.98)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.33)

Add feedback

Semi-Autoregressive Training Improves Mask-Predict Decoding

Ghazvininejad, Marjan, Levy, Omer, Zettlemoyer, Luke

arXiv.org Machine LearningJan-23-2020

The recently proposed mask-predict decoding algorithm has narrowed the performance gap between semi-autoregressive machine translation models and the traditional left-to-right approach. We introduce a new training method for conditional masked language models, SMART, which mimics the semi-autoregressive behavior of mask-predict, producing training examples that contain model predictions as part of their inputs. Models trained with SMART produce higher-quality translations when using mask-predict decoding, effectively closing the remaining performance gap with fully autoregressive models.

iteration, training example, translation, (16 more...)

arXiv.org Machine Learning

2001.08785

Genre: Research Report (0.40)

Industry: Materials > Metals & Mining > Gold (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.64)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.53)

Add feedback

Can Simple Neuron Interactions Capture Complex Linguistic Phenomena?

#artificialintelligenceJan-19-2020, 13:12:38 GMT

Deep neural machine translation (NMT) can learn representations containing linguistic information. And despite the differences between various models, they all tend to learn similar properties. This phenomena got researchers wondering whether the learned information is fully distributed and embedded to individual neurons. Recent research results confirmed that hypothesis, revealing that simple properties such as coordinating conjunctions and determiners can be attributed to individual neurons, while more complex linguistic properties such as syntax and semantics are distributed across multiple neurons. Following on this, researchers from The Chinese University of Hong Kong, Tencent AI Lab and University of Macau have proposed a new neuron interaction based representation composition for NMT.

interaction, interaction capture complex linguistic phenomenon, representation composition, (7 more...)

#artificialintelligence

Country:

Asia > Macao (0.27)
Asia > China > Hong Kong (0.27)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback