AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Compositional generalization in a deep seq2seq model by separating syntax and semantics

Russin, Jake, Jo, Jason, O'Reilly, Randall C., Bengio, Yoshua

arXiv.org Machine LearningApr-26-2019

Standard methods in deep learning for natural language processing fail to capture the compositional structure of human language that allows for systematic generalization outside of the training distribution. However, human learners readily generalize in this way, e.g. by applying known grammatical rules to novel words. Inspired by work in neuroscience suggesting separate brain systems for syntactic and semantic processing, we implement a modification to standard approaches in neural machine translation, imposing an analogous separation. The novel model, which we call Syntactic Attention, substantially outperforms standard methods in deep learning on the SCAN dataset, a compositional generalization task, without any hand-engineered features or additional supervision. Our work suggests that separating syntactic from semantic learning may be a useful heuristic for capturing compositional structure.

information, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

1904.09708

Country: North America > United States > Colorado (0.14)

Genre: Research Report (0.70)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.93)

Add feedback

Knowing When to Stop: Evaluation and Verification of Conformity to Output-size Specifications

Wang, Chenglong, Bunel, Rudy, Dvijotham, Krishnamurthy, Huang, Po-Sen, Grefenstette, Edward, Kohli, Pushmeet

arXiv.org Artificial IntelligenceApr-26-2019

Models such as Sequence-to-Sequence and Image-to-Sequence are widely used in real world applications. While the ability of these neural architectures to produce variable-length outputs makes them extremely effective for problems like Machine Translation and Image Captioning, it also leaves them vulnerable to failures of the form where the model produces outputs of undesirable length. This behavior can have severe consequences such as usage of increased computation and induce faults in downstream modules that expect outputs of a certain length. Motivated by the need to have a better understanding of the failures of these models, this paper proposes and studies the novel output-size modulation problem and makes two key technical contributions. First, to evaluate model robustness, we develop an easy-to-compute differentiable proxy objective that can be used with gradient-based algorithms to find output-lengthening inputs. Second and more importantly, we develop a verification approach that can formally verify whether a network always produces outputs within a certain length. Experimental results on Machine Translation and Image Captioning show that our output-lengthening approach can produce outputs that are 50 times longer than the input, while our verification approach can, given a model and input domain, prove that the output length is below a certain size.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

1904.12004

Genre: Research Report (0.82)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.94)
Information Technology > Artificial Intelligence > Vision (0.88)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.87)

Add feedback

Artificial Intelligence is Deciphering the World's Oldest Writings

#artificialintelligenceApr-22-2019, 00:01:48 GMT

Scientists are constantly figuring out how to expand the field of use of this incredible invention, which enables computer software to progressively improve its actions by adopting knowledge gained from previous experience. Machine learning, also referred to as artificial intelligence due to its ability to perform tasks using its own judgment, has been the subject of both praise and controversy. However, the sophisticated algorithms that have served in providing you ads on social networks might have a grand future in philology, archaeology, and linguistics. According to Émilie Pagé-Perron, a Ph.D. candidate in Assyriology at the University of Toronto, we might be closer than we thought to deciphering numerous Middle-Eastern cuneiform tablets written in Sumerian and Akkadian languages, all of which are several thousand years old. Pagé-Perron is in charge of the project officially titled Machine Translation and Automated Analysis of Cuneiform Languages, which currently operates in Frankfurt, Toronto, and Los Angeles, using combined efforts to create a program capable of translating the clay tablets.

artificial intelligence, machine translation, natural language, (11 more...)

#artificialintelligence

Country:

North America > Canada > Ontario > Toronto (0.80)
North America > United States > California > Los Angeles County > Los Angeles (0.26)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.40)

Add feedback

Unsupervised Text Generation from Structured Data

Schmitt, Martin, Schütze, Hinrich

arXiv.org Artificial IntelligenceApr-20-2019

This work presents a joint solution to two challenging tasks: text generation from data and open information extraction. We propose to model both tasks as sequence-to-sequence translation problems and thus construct a joint neural model for both. Our experiments on knowledge graphs from Visual Genome, i.e., structured image analyses, shows promising results compared to strong baselines. Building on recent work on unsupervised machine translation, we report the first results - to the best of our knowledge - on fully unsupervised text generation from structured data.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

1904.09447

Country:

Europe (0.94)
North America > United States > New Mexico (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

Constant-Time Machine Translation with Conditional Masked Language Models

Ghazvininejad, Marjan, Levy, Omer, Liu, Yinhan, Zettlemoyer, Luke

arXiv.org Artificial IntelligenceApr-19-2019

Most machine translation systems generate text autoregressively, by sequentially predicting tokens from left to right. We, instead, use a masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a partially masked target translation. This approach allows for efficient iterative decoding, where we first predict all of the target words non-autoregressively, and then repeatedly mask out and regenerate the subset of words that the model is least confident about. By applying this strategy for a constant number of iterations, our model improves state-of-the-art performance levels for constant-time translation models by over 3 BLEU on average. It is also able to reach 92-95% of the performance of a typical left-to-right transformer model, while decoding significantly faster.

artificial intelligence, iteration, natural language, (17 more...)

arXiv.org Artificial Intelligence

1904.09324

Country: North America > United States (0.28)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

r/MachineLearning - [D] Translating text from Portuguese to English - unexpected funny result

#artificialintelligenceApr-17-2019, 10:48:37 GMT

While following the guide on How to Develop a Neural Machine Translation System from Scratch, I'm trying to create a translation system for Portuguese-English.

artificial intelligence, natural language, unexpected funny result, (4 more...)

#artificialintelligence

Industry: Media > News (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Google has opened its first Africa Artificial Intelligence lab in Ghana

#artificialintelligenceApr-16-2019, 14:03:57 GMT

In seconds she gets a diagnosis of the disease affecting her plant and how best to manage it to boost her production. The farmer used an app on her phone based on TensorFlow, Google's Artificial Intelligence (AI) machine that the company opensourced to help developers create solutions to real-world problems. When people think of Artificial Intelligence, they most likely think of scenes from science fiction movies, but in reality, it applies to everyday life from virtual assistants to language translation on Google, says John Quinn, an AI researcher. Google now wants to position itself as an "AI first" company and with research centers across the globe in places such as Tokyo, Zurich, New York, and Paris. And last week, the technology company opened its first center in Africa in Ghana's capital city, Accra.

artificial intelligence, machine translation, natural language, (14 more...)

#artificialintelligence

Country:

Africa > Ghana > Greater Accra > Accra (0.63)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.25)

Industry: Information Technology > Services (0.53)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.36)

Add feedback

Corpora Generation for Grammatical Error Correction

Lichtarge, Jared, Alberti, Chris, Kumar, Shankar, Shazeer, Noam, Parmar, Niki, Tong, Simon

arXiv.org Machine LearningApr-10-2019

Grammatical Error Correction (GEC) has been recently modeled using the sequence-to-sequence framework. However, unlike sequence transduction problems such as machine translation, GEC suffers from the lack of plentiful parallel data. We describe two approaches for generating large parallel datasets for GEC using publicly available Wikipedia data. The first method extracts source-target pairs from Wikipedia edit histories with minimal filtration heuristics, while the second method introduces noise into Wikipedia sentences via round-trip translation through bridge languages. Both strategies yield similar sized parallel corpora containing around 4B tokens. We employ an iterative decoding strategy that is tailored to the loosely supervised nature of our constructed corpora. We demonstrate that neural GEC models trained using either type of corpora give similar performance. Fine-tuning these models on the Lang-8 corpus and ensembling allows us to surpass the state of the art on both the CoNLL-2014 benchmark and the JFLEG task. We provide systematic analysis that compares the two approaches to data generation and highlights the effectiveness of ensembling.

artificial intelligence, natural language, translation, (18 more...)

arXiv.org Machine Learning

1904.0578

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Membership Inference Attacks on Sequence-to-Sequence Models

Hisamoto, Sorami, Post, Matt, Duh, Kevin

arXiv.org Machine LearningApr-10-2019

Data privacy is an important issue for "machine learning as a service" providers. We focus on the problem of membership inference attacks: given a data sample and black-box access to a model's API, determine whether the sample existed in the model's training data. Our contribution is an investigation of this problem in the context of sequence-to-sequence models, which are important in applications such as machine translation and video captioning. We define the membership inference problem for sequence generation, provide an open dataset based on state-of-the-art machine translation models, and report initial results on whether these models leak private information against several kinds of membership inference attacks.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

1904.05506

Country:

Europe (1.00)
North America > United States (0.68)

Genre: Research Report > New Finding (0.93)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Consistency by Agreement in Zero-shot Neural Machine Translation

Al-Shedivat, Maruan, Parikh, Ankur P.

arXiv.org Machine LearningApr-10-2019

Generalization and reliability of multilingual translation often highly depend on the amount of available parallel data for each language pair of interest. In this paper, we focus on zero-shot generalization---a challenging setup that tests models on translation directions they have not been optimized for at training time. To solve the problem, we (i) reformulate multilingual translation as probabilistic inference, (ii) define the notion of zero-shot consistency and show why standard training often results in models unsuitable for zero-shot tasks, and (iii) introduce a consistent agreement-based training method that encourages the model to produce equivalent translations of parallel sentences in auxiliary languages. We test our multilingual NMT models on multiple public zero-shot translation benchmarks (IWSLT17, UN corpus, Europarl) and show that agreement-based learning often results in 2-3 BLEU zero-shot improvement over strong baselines without any loss in performance on supervised translation directions.

large language model, machine learning, translation, (20 more...)

arXiv.org Machine Learning

1904.02338

Country:

North America > United States (0.28)
Asia > Japan > Honshū (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback