Machine Translation
Google's new AI can help you speak another language in your own voice
Google Translate is one of the company's most used products. It helps people translate one language to another through typing, taking pics of text, and using speech-to-text technology. Now, the company's launching a new project called Translatotron, which will offer direct speech-to-speech translations โ without even using any text. In a post on Google's AI blog, the team behind the tool explained that instead of using speech-to-text and then text-to-speech to convert voice, it relied on a new model (which runs on a neural network) to develop the new system. Get 50% off tickets if you buy now.
Adaptive Attention Span in Transformers
Sukhbaatar, Sainbayar, Grave, Edouard, Bojanowski, Piotr, Joulin, Armand
Part of its success is due to its ability to model called Sequential Transformer capture long term dependencies. This is achieved (Vaswani et al., 2017). A Transformer is by taking long sequences as inputs and explicitly made of a sequence of layers that are composed of compute the relations between every token via a a block of parallel self-attention layers followed mechanism called the "self-attention" layer (Al-by a feedforward network. We refer to Vaswani Rfou et al., 2019).
A Case Study: Exploiting Neural Machine Translation to Translate CUDA to OpenCL
The sequence-to-sequence (seq2seq) model for neural machine translation has significantly improved the accuracy of language translation. There have been new efforts to use this seq2seq model for program language translation or program comparisons. In this work, we present the detailed steps of using a seq2seq model to translate CUDA programs to OpenCL programs, which both have very similar programming styles. Our work shows (i) a training input set generation method, (ii) pre/post processing, and (iii) a case study using Polybench-gpu-1.0, NVIDIA SDK, and Rodinia benchmarks.
An Emotion Detection System for Cantonese
Lee, John (City University of Hong Kong)
We present the first automatic emotion detection system for Cantonese. This system classifies input text into eight emotion classes: expectancy, joy, love, surprise, anxiety, sorrow, angry, or hate. While a number of emotion corpora and lexica for Mandarin Chinese have been developed, no emotion dataset is available for Cantonese. We leverage existing Mandarin Chinese emotion resources to build the system, with support from Cantonese-Mandarin lexical mappings from a machine translation system, as well as English-Mandarin lexical mappings to handle code-switching in Cantonese input. Evaluation on a set of Cantonese sentences from social media shows promising results.
Synchronous Bidirectional Neural Machine Translation
Zhou, Long, Zhang, Jiajun, Zong, Chengqing
Existing approaches to neural machine translation (NMT) generate the target language sequence token by token from left to right. However, this kind of unidirectional decoding framework cannot make full use of the target-side future contexts which can be produced in a right-to-left decoding direction, and thus suffers from the issue of unbalanced outputs. In this paper, we introduce a synchronous bidirectional neural machine translation (SB-NMT) that predicts its outputs using left-to-right and right-to-left decoding simultaneously and interactively, in order to leverage both of the history and future information at the same time. Specifically, we first propose a new algorithm that enables synchronous bidirectional decoding in a single model. Then, we present an interactive decoding model in which left-to-right (right-to-left) generation does not only depend on its previously generated outputs, but also relies on future contexts predicted by right-to-left (left-to-right) decoding. We extensively evaluate the proposed SB-NMT model on large-scale NIST Chinese-English, WMT14 English-German, and WMT18 Russian-English translation tasks. Experimental results demonstrate that our model achieves significant improvements over the strong Transformer model by 3.92, 1.49 and 1.04 BLEU points respectively, and obtains the state-of-the-art performance on Chinese-English and English-German translation tasks.
Assuring the Machine Learning Lifecycle: Desiderata, Methods, and Challenges
Ashmore, Rob, Calinescu, Radu, Paterson, Colin
Machine learning has evolved into an enabling technology for a wide range of highly successful applications. The potential for this success to continue and accelerate has placed machine learning (ML) at the top of research, economic and political agendas. Such unprecedented interest is fuelled by a vision of ML applicability extending to healthcare, transportation, defence and other domains of great societal importance. Achieving this vision requires the use of ML in safety-critical applications that demand levels of assurance beyond those needed for current ML applications. Our paper provides a comprehensive survey of the state-of-the-art in the assurance of ML, i.e. in the generation of evidence that ML is sufficiently safe for its intended use. The survey covers the methods capable of providing such evidence at different stages of the machine learning lifecycle, i.e. of the complex, iterative process that starts with the collection of the data used to train an ML component for a system, and ends with the deployment of that component within the system. The paper begins with a systematic presentation of the ML lifecycle and its stages. We then define assurance desiderata for each stage, review existing methods that contribute to achieving these desiderata, and identify open challenges that require further research.
Survey on Evaluation Methods for Dialogue Systems
Deriu, Jan, Rodrigo, Alvaro, Otegi, Arantxa, Echegoyen, Guillermo, Rosset, Sophie, Agirre, Eneko, Cieliebak, Mark
In this paper we survey the methods and concepts developed for the evaluation of dialogue systems. Evaluation is a crucial part during the development process. Often, dialogue systems are evaluated by means of human evaluations and questionnaires. However, this tends to be very cost and time intensive. Thus, much work has been put into finding methods, which allow to reduce the involvement of human labour. In this survey, we present the main concepts and methods. For this, we differentiate between the various classes of dialogue systems (task-oriented dialogue systems, conversational dialogue systems, and question-answering dialogue systems). We cover each class by introducing the main technologies developed for the dialogue systems and then by presenting the evaluation methods regarding this class.
Artificial Intelligence in Gaming: The Responsible Way ShowsHappening
No refund on tickets will be made under any circumstances unless otherwise instructed by the event organiser Venue Owner reserves the right without refund or compensation to refuse admission to any persons whose conduct is disorderly or unbecoming. The Promoter may add, withdraw or substitute artistes and/or vary advertised programmes, event times, seating arrangements and audience capacity without prior notice. The Promoter/Venue Owner may postpone, cancel,interrupt or stop the event due to adverse weather, dangerous situations, or any other causes beyond his reasonable control. Information is collected from those registering with www.showshappening.com in order to facilitate ticket purchase or other services available. This information is collected and used in accordance with ShowsHappening's Privacy Policy, which forms part of these conditions.
Neural Machine Translation Engineer ai-jobs.net
Do you dream of harnessing your engineering and machine learning knowledge to enrich users' lives? To improve their privacy and security? To expand their ecosystems by opening up models and data to the world? If so, you should join Mozilla's Machine Learning group as part of the three year, EU funded Bergamot Project. The goal of Bergamot is to extend Firefox with an open, on-device neural machine translation (NMT) engine, making translation local, private, and secure.
Semantic Drift in Multilingual Representations
Beinborn, Lisa, Choenni, Rochelle
Multilingual representations have mostly been evaluated based on their performance on specific tasks. In this article, we look beyond engineering goals and analyze the relations between languages in computational representations. We introduce a methodology for comparing languages based on their organization of semantic concepts. We propose to conduct an adapted version of representational similarity analysis of a selected set of concepts in computational multilingual representations. Using this analysis method, we can reconstruct a phylogenetic tree that closely resembles those assumed by linguistic experts. These results indicate that multilingual distributional representations which are only trained on monolingual text and bilingual dictionaries preserve relations between languages without the need for any etymological information. In addition, we propose a measure to identify semantic drift between language families. We perform experiments on word-based and sentence-based multilingual models and provide both quantitative results and qualitative examples. Analyses of semantic drift in multilingual representations can serve two purposes: they can indicate unwanted characteristics of the computational models and they provide a quantitative means to study linguistic phenomena across languages. The code is available at https://github.com/beinborn/SemanticDrift.