AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Transformers without Tears: Improving the Normalization of Self-Attention

Nguyen, Toan Q., Salazar, Julian

arXiv.org Machine LearningOct-13-2019

We evaluate three simple, normalization-centric changes to improve Transformer training. First, we show that pre-norm residual connections (PreNorm) and smaller initializations enable warmup-free, validation-based training with large learning rates. Second, we propose $\ell_2$ normalization with a single scale parameter (ScaleNorm) for faster training and better performance. Finally, we reaffirm the effectiveness of normalizing word embeddings to a fixed length (FixNorm). On five low-resource translation pairs from TED Talks-based corpora, these changes always converge, giving an average +1.1 BLEU over state-of-the-art bilingual baselines and a new 32.8 BLEU on IWSLT'15 English-Vietnamese. We observe sharper performance curves, more consistent gradient norms, and a linear relationship between activation scaling and decoder depth. Surprisingly, in the high-resource setting (WMT'14 English-German), ScaleNorm and FixNorm remain competitive but PreNorm degrades performance.

ayern orm, calen orm, orm, (12 more...)

arXiv.org Machine Learning

1910.05895

Country: Europe > Czechia > Prague (0.04)

Genre: Research Report (1.00)

Industry: Education > Educational Setting > Online (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Using Neural Machine Translation for Multilingual Communication

#artificialintelligenceOct-11-2019, 15:52:25 GMT

A new type of Artificial Intelligence (AI) technology, called Neural Machine Translation (NMT), is quickly earning the attention of multilingual communities. This software is helping to expedite the translation process and has the potential to open government information to more non-English languages. In this session, Beth Flaherty will give a high-level overview of machine translation technology. We will discuss the evolution of machine translation (MT), how MT is used in the government, ways to "specialize" a language engine to a specific domain, calculation of return on investment (ROI), and the road ahead. We'll also show a live demo of the NMT software so that the audience can see the flexibility of use with this technology.

beth flaherty, multilingual communication, neural machine translation, (2 more...)

#artificialintelligence

Country: North America > United States > District of Columbia > Washington (0.07)

Industry: Government (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

BiPaR: A Bilingual Parallel Dataset for Multilingual and Cross-lingual Reading Comprehension on Novels

Jing, Yimin, Xiong, Deyi, Zhen, Yan

arXiv.org Artificial IntelligenceOct-11-2019

This paper presents BiPaR, a bilingual parallel novel-style machine reading comprehension (MRC) dataset, developed to support multilingual and cross-lingual reading comprehension. The biggest difference between BiPaR and existing reading comprehension datasets is that each triple (Passage, Question, Answer) in BiPaR is written parallelly in two languages. We collect 3,667 bilingual parallel paragraphs from Chinese and English novels, from which we construct 14,668 parallel question-answer pairs via crowdsourced workers following a strict quality control procedure. We analyze BiPaR in depth and find that BiPaR offers good diversification in prefixes of questions, answer types and relationships between questions and passages. We also observe that answering questions of novels requires reading comprehension skills of coreference resolution, multi-sentence reasoning, and understanding of implicit causality, etc. With BiPaR, we build monolingual, multilingual, and cross-lingual MRC baseline models. Even for the relatively simple monolingual MRC on this dataset, experiments show that a strong BERT baseline is over 30 points behind human in terms of both EM and F1 score, indicating that BiPaR provides a challenging testbed for monolingual, multilingual and cross-lingual MRC on novels. The dataset is available at https://multinlp.github.io/BiPaR/.

artificial intelligence, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

1910.0504

Country:

Asia > China (0.04)
North America > Canada (0.04)
Europe > Italy > Liguria > Genoa (0.04)
Europe > Greece > Ionian Islands > Corfu (0.04)

Genre: Research Report (1.00)

Industry: Education > Assessment & Standards > Student Performance (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Artificial Intelligence and how they are empowering Search for Mobile, Web Apps - Ongraph

#artificialintelligenceOct-10-2019, 12:32:33 GMT

Google Translate is one of the popular and highly useful product of Google. It is based on Artificial Intelligence Algorithm. Google is constantly changing its translation application using artificial intelligence (AI). It is using Neural Machine Translation into Google Translate, which has radically improved results. AI team of the company calls it the Google Neural Machine Translation System (GNMT).

artificial intelligence, google translate, ongraph, (1 more...)

#artificialintelligence

Industry: Information Technology > Software (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Language Transfer for Early Warning of Epidemics from Social Media

Appelgren, Mattias, Schrempf, Patrick, Falis, Matúš, Ikeda, Satoshi, O'Neil, Alison Q

arXiv.org Artificial IntelligenceOct-10-2019

Statements on social media can be analysed to identify individuals who are experiencing red flag medical symptoms, allowing early detection of the spread of disease such as influenza. Since disease does not respect cultural borders and may spread between populations speaking different languages, we would like to build multilingual models. However, the data required to train models for every language may be difficult, expensive and time-consuming to obtain, particularly for low-resource languages. Taking Japanese as our target language, we explore methods by which data in one language might be used to build models for a different language. We evaluate strategies of training on machine translated data and of zero-shot transfer through the use of multilingual models. We find that the choice of source language impacts the performance, with Chinese-Japanese being a better language pair than English-Japanese. Training on machine translated data shows promise, especially when used in conjunction with a small amount of target language data.

experiment, mbert, translation, (13 more...)

arXiv.org Artificial Intelligence

1910.04519

Country:

North America > United States (0.04)
North America > Canada (0.04)
Europe (0.04)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.94)
Health & Medicine > Therapeutic Area > Immunology (0.72)

Technology:

Information Technology > Communications > Social Media (0.86)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Straker Translations on Twitter

#artificialintelligenceOct-9-2019, 22:58:40 GMT

The Japanese language is wonderfully unique, complex & can be one of the hardest languages to learn. So how well does machine translation handle the Japanese language? Have a read of our latest blog to find out.

japanese language, straker translation, twitter

#artificialintelligence

Country: Asia > Japan (0.23)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.89)

Add feedback

Machine Learning Intern (Summer 2020) ai-jobs.net

#artificialintelligenceOct-8-2019, 22:51:38 GMT

Mozilla is hiring a Machine Learning Intern for our Emerging Technologies team. Emerging Technologies is Mozilla's early research and development organization focused on the areas of voice assistants, speech and language, and mixed reality. Our headquarters are based in the Bay Area, but this internship opportunity is at our Berlin Office. We are engineers, designers, makers, and problem solvers. We work in the fishbowl known as the open source community, with a clear focus on making the Web better.

machine learning intern, mozilla, summer 2020, (2 more...)

#artificialintelligence

Country: Europe > Germany > Berlin (0.07)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.40)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.36)

Add feedback

Why 85% of AI projects fail

#artificialintelligenceOct-7-2019, 17:37:21 GMT

Despite increased interest in and adoption of artificial intelligence (AI) in the enterprise, 85% of AI projects ultimately fail to deliver on their intended promises to business, according to a Thursday report from Pactera Technologies. A major source of AI challenges is found in senior leadership, the report, titled Artificial Intelligence Localization, Winners, Losers, Heroes, Spectators, and You, found. Some 77% of those surveyed said they face barriers to entry from senior management not seeing value or wanting to make the investment in the emerging technology. These findings are in line with those from a recent Dimensional Research report, which found that eight out of 10 organizations engaged with AI and machine learning said those projects had stalled, and 96% said they have run into problems with data quality, data labelling, and building model confidence. Pactera presented the report to a group of tech industry leaders including those from Facebook, Adobe, Amazon, and Microsoft at a recent private event in Seattle.

ai project fail, neural machine translation, techrepublic, (4 more...)

#artificialintelligence

Genre: Research Report (0.75)

Industry: Information Technology (0.55)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.57)
Information Technology > Communications > Social Media (0.49)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.42)

Add feedback

MLPerf Training Benchmark

Mattson, Peter, Cheng, Christine, Coleman, Cody, Diamos, Greg, Micikevicius, Paulius, Patterson, David, Tang, Hanlin, Wei, Gu-Yeon, Bailis, Peter, Bittorf, Victor, Brooks, David, Chen, Dehao, Dutta, Debojyoti, Gupta, Udit, Hazelwood, Kim, Hock, Andrew, Huang, Xinyuan, Jia, Bill, Kang, Daniel, Kanter, David, Kumar, Naveen, Liao, Jeffery, Narayanan, Deepak, Oguntebi, Tayo, Pekhimenko, Gennady, Pentecost, Lillian, Reddi, Vijay Janapa, Robie, Taylor, John, Tom St., Wu, Carole-Jean, Xu, Lingjie, Young, Cliff, Zaharia, Matei

arXiv.org Machine LearningOct-2-2019

Machine learning is experiencing an explosion of software and hardware solutions, and needs industry-standard performance benchmarks to drive design and enable competitive evaluation. However, machine learning training presents a number of unique challenges to benchmarking that do not exist in other domains: (1) some optimizations that improve training throughput actually increase time to solution, (2) training is stochastic and time to solution has high variance, and (3) the software and hardware systems are so diverse that they cannot be fairly benchmarked with the same binary, code, or even hyperparameters. We present MLPerf, a machine learning benchmark that overcomes these challenges. We quantitatively evaluate the efficacy of MLPerf in driving community progress on performance and scalability across two rounds of results from multiple vendors.

benchmark, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

1910.015

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > California > Alameda County > Berkeley (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report (0.82)

Industry:

Leisure & Entertainment > Games (0.68)
Information Technology > Software (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Murray, Kenton, Kinnison, Jeffery, Nguyen, Toan Q., Scheirer, Walter, Chiang, David

arXiv.org Machine LearningOct-1-2019

Neural sequence-to-sequence models, particularly the Transformer, are the state of the art in machine translation. Y et these neural networks are very sensitive to architecture and hyper-parameter settings. Optimizing these settings by grid or random search is computationally expensive because it requires many training runs. In this paper, we incorporate architecture search into a single training run through auto-sizing, which uses regularization to delete neurons in a network over the course of training. On very low-resource language pairs, we show that auto-sizing can improve BLEU scores by up to 3.9 points while removing one-third of the parameters from the model.

bleu score, language pair, transformer, (15 more...)

arXiv.org Machine Learning

1910.06717

Country:

North America > United States > California (0.14)
Europe > Czechia > Prague (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback