Machine Translation
Approches quantitatives de l'analyse des pr{\'e}dictions en traduction automatique neuronale (TAN)
Zimina-Poirot, Maria, Ballier, Nicolas, Yunès, Jean-Baptiste
As part of a larger project on optimal learning conditions in neural machine translation, we investigate characteristic training phases of translation engines. All our experiments are carried out using OpenNMT-Py: the pre-processing step is implemented using the Europarl training corpus and the INTERSECT corpus is used for validation. Longitudinal analyses of training phases suggest that the progression of translations is not always linear. Following the results of textometric explorations, we identify the importance of the phenomena related to chronological progression, in order to map different processes at work in neural machine translation (NMT).
Towards Coinductive Models for Natural Language Understanding. Bringing together Deep Learning and Deep Semantics
This article contains a proposal to add coinduction to the computational apparatus of natural language understanding. This, we argue, will provide a basis for more realistic, computationally sound, and scalable models of natural language dialogue, syntax and semantics. Given that the bottom up, inductively constructed, semantic and syntactic structures are brittle, and seemingly incapable of adequately representing the meaning of longer sentences or realistic dialogues, natural language understanding is in need of a new foundation. Coinduction, which uses top down constraints, has been successfully used in the design of operating systems and programming languages. Moreover, implicitly it has been present in text mining, machine translation, and in some attempts to model intensionality and modalities, which provides evidence that it works. This article shows high level formalizations of some of such uses. Since coinduction and induction can coexist, they can provide a common language and a conceptual model for research in natural language understanding. In particular, such an opportunity seems to be emerging in research on compositionality. This article shows several examples of the joint appearance of induction and coinduction in natural language processing. We argue that the known individual limitations of induction and coinduction can be overcome in empirical settings by a combination of the the two methods. We see an open problem in providing a theory of their joint use.
Multimodal Learning for Hateful Memes Detection
Memes are used for spreading ideas through social networks. Although most memes are created for humor, some memes become hateful under the combination of pictures and text. Automatically detecting the hateful memes can help reduce their harmful social impact. Unlike the conventional multimodal tasks, where the visual and textual information is semantically aligned, the challenge of hateful memes detection lies in its unique multimodal information. The image and text in memes are weakly aligned or even irrelevant, which requires the model to understand the content and perform reasoning over multiple modalities. In this paper, we focus on multimodal hateful memes detection and propose a novel method that incorporates the image captioning process into the memes detection process. We conduct extensive experiments on multimodal meme datasets and illustrated the effectiveness of our approach. Our model achieves promising results on the Hateful Memes Detection Challenge.
Reciprocal Supervised Learning Improves Neural Machine Translation
Xu, Minkai, Wang, Mingxuan, Lin, Zhouhan, Zhou, Hao, Zhang, Weinan, Li, Lei
Despite the recent success on image classification, self-training has only achieved limited gains on structured prediction tasks such as neural machine translation (NMT). This is mainly due to the compositionality of the target space, where the far-away prediction hypotheses lead to the notorious reinforced mistake problem. In this paper, we revisit the utilization of multiple diverse models and present a simple yet effective approach named Reciprocal-Supervised Learning (RSL). RSL first exploits individual models to generate pseudo parallel data, and then cooperatively trains each model on the combined synthetic corpus. RSL leverages the fact that different parameterized models have different inductive biases, and better predictions can be made by jointly exploiting the agreement among each other. Unlike the previous knowledge distillation methods built upon a much stronger teacher, RSL is capable of boosting the accuracy of one model by introducing other comparable or even weaker models. RSL can also be viewed as a more efficient alternative to ensemble. Extensive experiments demonstrate the superior performance of RSL on several benchmarks with significant margins.
Customizing your machine translation using Amazon Translate Active Custom Translation
When translating the English phrase "How are you?" to Spanish, would you prefer to use "¿Cómo estás?" or "¿Cómo está usted?" instead? Amazon Translate is a neural machine translation service that delivers fast, high-quality, and affordable language translation. Today, we're excited to introduce Active Custom Translation (ACT), a feature that gives you more control over your machine translation output. You can now influence what machine translation output you would like to get between "¿Cómo estás?" or "¿Cómo está usted?". To make ACT work, simply provide your translation examples in TMX, TSV, or CSV format to create parallel data (PD), and Amazon Translate uses your PD along with your batch translation job to customize the translation output at runtime.
Facilitating the Communication of Politeness through Fine-Grained Paraphrasing
Fu, Liye, Fussell, Susan R., Danescu-Niculescu-Mizil, Cristian
Aided by technology, people are increasingly able to communicate across geographical, cultural, and language barriers. This ability also results in new challenges, as interlocutors need to adapt their communication approaches to increasingly diverse circumstances. In this work, we take the first steps towards automatically assisting people in adjusting their language to a specific communication circumstance. As a case study, we focus on facilitating the accurate transmission of pragmatic intentions and introduce a methodology for suggesting paraphrases that achieve the intended level of politeness under a given communication circumstance. We demonstrate the feasibility of this approach by evaluating our method in two realistic communication scenarios and show that it can reduce the potential for misalignment between the speaker's intentions and the listener's perceptions in both cases.
Gnani.ai launches its new speech recognition technology for Indian defense – TechGraph
"These end-to-end voice translation system uses Automatic Speech Recognition (ASR), Machine Translation and Speech-to-Text to convert Mandarin to English and is designed to help armed forces, intelligence agencies and local law enforcement authorities in improving communication systems and giving substantial leeway to the Indian defense forces," the company in its statement said. The solution has a wide range of applications that includes cross border intelligence, voice surveillance, monitoring telephone/internet conversations, intercepting Radio/Satellite communication, and to bridge interactions during border meetings & joint exercises. Its unique features include noise reduction, dialect/accent detection, and support for all audio file formats. Speaking on the launch, Ananth Nagaraj, Co-founder & CTO, Gnani.ai said, "AI-based Speech Recognition technology is a necessity and is quickly making its way in becoming part of modern warfare. We believe AI has the potential to transform and improve the communication systems and will help strengthen Indian Armed forces." "Understanding linguistic nuances such as phoneme and dialects is a challenge especially when it comes to Mandarin.
Unsupervised Word Translation Pairing using Refinement based Point Set Registration
Oprea, Silviu, Dutta, Sourav, Assem, Haytham
Cross-lingual alignment of word embeddings play an important role in knowledge transfer across languages, for improving machine translation and other multi-lingual applications. Current unsupervised approaches rely on similarities in geometric structure of word embedding spaces across languages, to learn structure-preserving linear transformations using adversarial networks and refinement strategies. However, such techniques, in practice, tend to suffer from instability and convergence issues, requiring tedious fine-tuning for precise parameter setting. This paper proposes BioSpere, a novel framework for unsupervised mapping of bi-lingual word embeddings onto a shared vector space, by combining adversarial initialization and refinement procedure with point set registration algorithm used in image processing. We show that our framework alleviates the shortcomings of existing methodologies, and is relatively invariant to variable adversarial learning performance, depicting robustness in terms of parameter choices and training losses. Experimental evaluation on parallel dictionary induction task demonstrates state-of-the-art results for our framework on diverse language pairs.
More accurate than Google Translate? Meet the Slovenian AI startup offering quality language translations, coming to UK soon - UKTN (UK Tech News)
Speaking to UKTN, Marko Hozjan, co-founder and CEO of TAIA, explains, "TAIA helps businesses translate their content more efficiently by providing professional translators with AI assistance. Files are automatically analysed and a price quote with delivery times is available in under a minute. Users can select between a range of services and delivery times to order a translation service that best fits their needs and budget. Once the project is ordered, it's automatically translated using Neural Machine Translation and prefilled with existing translations from customers' unique Translation memory. This way your projects get translated faster and more consistently with every order. Users can monitor the progress of their project in the convenient web application and easily manage all their translation needs in one place, keeping their data secure and their costs optimised."
AGenT Zero: Zero-shot Automatic Multiple-Choice Question Generation for Skill Assessments
Li, Eric, Su, Jingyi, Sheng, Hao, Wai, Lawrence
Multiple-choice questions (MCQs) offer the most promising avenue for skill evaluation in the era of virtual education and job recruiting, where traditional performance-based alternatives such as projects and essays have become less viable, and grading resources are constrained. The automated generation of MCQs would allow assessment creation at scale. Recent advances in natural language processing have given rise to many complex question generation methods. However, the few methods that produce deployable results in specific domains require a large amount of domain-specific training data that can be very costly to acquire. Our work provides an initial foray into MCQ generation under high data-acquisition cost scenarios by strategically emphasizing paraphrasing the question context (compared to the task). In addition to maintaining semantic similarity between the question-answer pairs, our pipeline, which we call AGenT Zero, consists of only pre-trained models and requires no fine-tuning, minimizing data acquisition costs for question generation. AGenT Zero successfully outperforms other pre-trained methods in fluency and semantic similarity. Additionally, with some small changes, our assessment pipeline can be generalized to a broader question and answer space, including short answer or fill in the blank questions.