AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

ACL 2019 Best Papers Announced

#artificialintelligenceAug-1-2019, 16:29:22 GMT

The Association for Computational Linguistics (ACL) held its 57th annual meeting July 28 to August 2 in Florence, Italy. Today, the ACL 2019 organizing committee announced its eight paper awards: Best Long Paper, Best Short Paper, Best Demo Paper, and five Outstanding Paper awards. The paper addresses the issue by sampling context words both from the ground truth sequence and the predicted sequence by a model during training. Researchers tested the approach on Chinese to English and WMT'14 English to German translation tasks, and achieved significant improvements on various datasets. Click here to read the full paper.

artificial intelligence, machine translation, natural language, (11 more...)

#artificialintelligence

Country:

Europe > Italy > Tuscany > Florence (0.26)
North America > United States > Washington > King County > Seattle (0.17)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.52)

Add feedback

Bilingual Lexicon Induction through Unsupervised Machine Translation

Artetxe, Mikel, Labaka, Gorka, Agirre, Eneko

arXiv.org Artificial IntelligenceJul-24-2019

A recent research line has obtained strong results on bilingual lexicon induction by aligning independently trained word embeddings in two languages and using the resulting cross-lingual embeddings to induce word translation pairs through nearest neighbor or related retrieval methods. In this paper, we propose an alternative approach to this problem that builds on the recent work on unsupervised machine translation. This way, instead of directly inducing a bilingual lexicon from cross-lingual embeddings, we use them to build a phrase-table, combine it with a language model, and use the resulting machine translation system to generate a synthetic parallel corpus, from which we extract the bilingual lexicon using statistical word alignment techniques. As such, our method can work with any word embedding and cross-lingual mapping technique, and it does not require any additional resource besides the monolingual corpus used to train the embeddings. When evaluated on the exact same cross-lingual embeddings, our proposed method obtains an average improvement of 6 accuracy points over nearest neighbor and 4 points over CSLS retrieval, establishing a new state-of-the-art in the standard MUSE dataset.

artificial intelligence, computational linguistic, natural language, (13 more...)

arXiv.org Artificial Intelligence

1907.10761

Country:

Europe (0.71)
North America > United States (0.68)

Genre: Research Report (0.83)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Microsoft Unveiled a New Language Translation Feature for Its HoloLens Holograms Digital Trends

#artificialintelligenceJul-20-2019, 20:37:01 GMT

Not only is it possible to have a fairly realistic holographic replica of yourself, but Microsoft has just shown that it is also possible to have that same replica speak in different languages, too. According to The Verge, on Wednesday, July 17, Microsoft provided a demo of this latest innovation during its keynote speech at the Microsoft Inspire partner conference in Las Vegas. Tom Warren of The Verge posted a video clip on YouTube of Microsoft's demonstration of the hologram's language translation capabilities. Microsoft's demonstration of the technology included Azure executive Julia White, a HoloLens 2 headset, and White's hologram. White's hologram began as a small green outline of a hologram that White could hold in her hand, but as soon as she uttered two simple words, "render keynote," the small hologram grew into a fully rendered, human-sized replica of White and immediately began delivering the keynote speech in Japanese, in a voice that still matched White's.

artificial intelligence, microsoft, natural language, (11 more...)

#artificialintelligence

Country: North America > United States > Nevada > Clark County > Las Vegas (0.27)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.65)

Add feedback

Lookahead Optimizer: k steps forward, 1 step back

Zhang, Michael R., Lucas, James, Hinton, Geoffrey, Ba, Jimmy

arXiv.org Machine LearningJul-19-2019

The vast majority of successful deep neural networks are trained using variants of stochastic gradient descent (SGD) algorithms. Recent attempts to improve SGD can be broadly categorized into two approaches: (1) adaptive learning rate schemes, such as AdaGrad and Adam, and (2) accelerated schemes, such as heavy-ball and Nesterov momentum. In this paper, we propose a new optimization algorithm, Lookahead, that is orthogonal to these previous approaches and iteratively updates two sets of weights. Intuitively, the algorithm chooses a search direction by \emph{looking ahead} at the sequence of "fast weights" generated by another optimizer. We show that Lookahead improves the learning stability and lowers the variance of its inner optimizer with negligible computation and memory cost. We empirically demonstrate Lookahead can significantly improve the performance of SGD and Adam, even with their default hyperparameter settings on ImageNet, CIFAR-10/100, neural machine translation, and Penn Treebank.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Machine Learning

1907.0861

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.90)

Add feedback

Hierarchical Sequence to Sequence Voice Conversion with Limited Data

Narayanan, Praveen, Chakravarty, Punarjay, Charette, Francois, Puskorius, Gint

arXiv.org Machine LearningJul-15-2019

We present a voice conversion solution using recurrent sequence to sequence modeling for DNNs. Our solution takes advantage of recent advances in attention based modeling in the fields of Neural Machine Translation (NMT), Text-to-Speech (TTS) and Automatic Speech Recognition (ASR). The problem consists of converting between voices in a parallel setting when {\it $<$source,target$>$} audio pairs are available. Our seq2seq architecture makes use of a hierarchical encoder to summarize input audio frames. On the decoder side, we use an attention based architecture used in recent TTS works. Since there is a dearth of large multispeaker voice conversion databases needed for training DNNs, we resort to training the network with a large single speaker dataset as an autoencoder. This is then adapted for the smaller multispeaker voice conversion datasets available for voice conversion. In contrast with other voice conversion works that use $F_0$, duration and linguistic features, our system uses mel spectrograms as the audio representation. Output mel frames are converted back to audio using a wavenet vocoder.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

1907.07769

Country: North America > United States (0.46)

Genre: Research Report (0.52)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.89)

Add feedback

Ten Machine Learning Algorithms You Should Know to Become a Data Scientist

#artificialintelligenceJul-14-2019, 22:33:30 GMT

Let's say I am given an Excel sheet with data about various fruits and I have to tell which look like Apples. What I will do is ask a question "Which fruits are red and round?" and divide all fruits which answer yes and no to the question. Now, All Red and Round fruits might not be apples and all apples won't be red and round. So I will ask a question "Which fruits have red or yellow colour hints on them? " on red and round fruits and will ask "Which fruits are green and round?" on not red and round fruits. Based on these questions I can tell with considerable accuracy which are apples. This cascade of questions is what a decision tree is. However, this is a decision tree based on my intuition.

machine learning, natural language, reinforcement learning, (17 more...)

#artificialintelligence

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.05)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Middle East > Qatar (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.74)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.73)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.70)
(2 more...)

Add feedback

Task Selection Policies for Multitask Learning

Glover, John, Hokamp, Chris

arXiv.org Machine LearningJul-14-2019

One of the questions that arises when designing models that learn to solve multiple tasks simultaneously is how much of the available training budget should be devoted to each individual task. We refer to any formalized approach to addressing this problem (learned or otherwise) as a task selection policy. In this work we provide an empirical evaluation of the performance of some common task selection policies in a synthetic bandit-style setting, as well as on the GLUE benchmark for natural language understanding. We connect task selection policy learning to existing work on automated curriculum learning and off-policy evaluation, and suggest a method based on counterfactual estimation that leads to improved model performance in our experimental settings.

arxiv, learning, task selection policy, (14 more...)

arXiv.org Machine Learning

1907.06214

Country:

North America > United States > New York (0.04)
North America > United States > Massachusetts > Plymouth County > Norwell (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(4 more...)

Genre: Research Report (0.50)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.51)
(2 more...)

Add feedback

Self-Regulated Interactive Sequence-to-Sequence Learning

Kreutzer, Julia, Riezler, Stefan

arXiv.org Machine LearningJul-11-2019

Not all types of supervision signals are created equal: Different types of feedback have different costs and effects on learning. We show how self-regulation strategies that decide when to ask for which kind of feedback from a teacher (or from oneself) can be cast as a learning-to-learn problem leading to improved cost-aware sequence-to-sequence learning. In experiments on interactive neural machine translation, we find that the self-regulator discovers an $\epsilon$-greedy strategy for the optimal cost-quality trade-off by mixing different feedback types including corrections, error markups, and self-supervision. Furthermore, we demonstrate its robustness under domain shift and identify it as a promising alternative to active learning.

machine learning, natural language, translation, (18 more...)

arXiv.org Machine Learning

1907.0519

Country:

Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(23 more...)

Genre: Research Report (1.00)

Industry: Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Multiple Generative Models Ensemble for Knowledge-Driven Proactive Human-Computer Dialogue Agent

Dai, Zelin, Liu, Weitang, Zhang, Hao, Zhu, Minghao, Wang, Long

arXiv.org Artificial IntelligenceJul-8-2019

Multiple sequence to sequence models were used to establish an end-to-end multi-turns proactive dialogue generation agent, with the aid of data augmentation techniques and variant encoder-decoder structure designs. A rank-based ensemble approach was developed for boosting performance. Results indicate that our single model, in average, makes an obvious improvement in the terms of F1-score and BLEU over the baseline by 18.67% on the DuConv dataset. In particular, the ensemble methods further significantly outperform the baseline by 35.85%.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

1907.0359

Country:

North America > Haiti (0.15)
Europe > Switzerland (0.14)

Genre: Research Report (0.50)

Industry:

Leisure & Entertainment (0.70)
Media > Film (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.39)

Add feedback

Learning Neural Sequence-to-Sequence Models from Weak Feedback with Bipolar Ramp Loss

Jehl, Laura, Lawrence, Carolin, Riezler, Stefan

arXiv.org Machine LearningJul-6-2019

In many machine learning scenarios, supervision by gold labels is not available and consequently neural models cannot be trained directly by maximum likelihood estimation (MLE). In a weak supervision scenario, metric-augmented objectives can be employed to assign feedback to model outputs, which can be used to extract a supervision signal for training. We present several objectives for two separate weakly supervised tasks, machine translation and semantic parsing. We show that objectives should actively discourage negative outputs in addition to promoting a surrogate gold structure. This notion of bipolarity is naturally present in ramp loss objectives, which we adapt to neural models. We show that bipolar ramp loss objectives outperform other non-bipolar ramp loss objectives and minimum risk training (MRT) on both weakly supervised tasks, as well as on a supervised machine translation task. Additionally, we introduce a novel token-level ramp loss objective, which is able to outperform even the best sequence-level ramp loss on both weakly supervised tasks.

artificial intelligence, machine learning, natural language, (15 more...)

arXiv.org Machine Learning

1907.03748

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)
Oceania > Australia > New South Wales > Sydney (0.04)
(20 more...)

Genre:

Research Report > New Finding (0.46)
Research Report > Experimental Study (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.89)
(3 more...)

Add feedback