Goto

Collaborating Authors

 Machine Translation


Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation

arXiv.org Artificial Intelligence

A neural multimodal machine translation (MMT) system is one that aims to perform better translation by extending conventional text-only translation models with multimodal information. Many recent studies report improvements when equipping their models with the multimodal module, despite the controversy of whether such improvements indeed come from the multimodal part. We revisit the contribution of multimodal information in MMT by devising two interpretable MMT models. To our surprise, although our models replicate similar gains as recently developed multimodal-integrated systems achieved, our models learn to ignore the multimodal information. Upon further investigation, we discover that the improvements achieved by the multimodal models over text-only counterparts are in fact results of the regularization effect. We report empirical findings that highlight the importance of MMT models' interpretability, and discuss how our findings will benefit future research.


The Future of Computational Linguistics: On Beyond Alchemy

#artificialintelligence

Over the decades, fashions in Computational Linguistics have changed again and again, with major shifts in motivations, methods and applications. When digital computers first appeared, linguistic analysis adopted the new methods of information theory, which accorded well with the ideas that dominated psychology and philosophy. Then came formal language theory and the idea of AI as applied logic, in sync with the development of cognitive science. That was followed by a revival of 1950s-style empiricismโ€”AI as applied statisticsโ€”which in turn was followed by the age of deep nets. There are signs that the climate is changing again, and we offer some thoughts about paths forward, especially for younger researchers who will soon be the leaders.


Data Augmentation for Text Generation Without Any Augmented Data

arXiv.org Artificial Intelligence

Data augmentation is an effective way to improve the performance of many neural text generation models. However, current data augmentation methods need to define or choose proper data mapping functions that map the original samples into the augmented samples. In this work, we derive an objective to formulate the problem of data augmentation on text generation tasks without any use of augmented data constructed by specific mapping functions. Our proposed objective can be efficiently optimized and applied to popular loss functions on text generation tasks with a convergence rate guarantee. Experiments on five datasets of two text generation tasks show that our approach can approximate or even surpass popular data augmentation methods.


Selective Knowledge Distillation for Neural Machine Translation

arXiv.org Artificial Intelligence

Neural Machine Translation (NMT) models achieve state-of-the-art performance on many translation benchmarks. As an active research field in NMT, knowledge distillation is widely applied to enhance the model's performance by transferring teacher model's knowledge on each training sample. However, previous work rarely discusses the different impacts and connections among these samples, which serve as the medium for transferring teacher knowledge. In this paper, we design a novel protocol that can effectively analyze the different impacts of samples by comparing various samples' partitions. Based on above protocol, we conduct extensive experiments and find that the teacher's knowledge is not the more, the better. Knowledge over specific samples may even hurt the whole performance of knowledge distillation. Finally, to address these issues, we propose two simple yet effective strategies, i.e., batch-level and global-level selections, to pick suitable samples for distillation. We evaluate our approaches on two large-scale machine translation tasks, WMT'14 English->German and WMT'19 Chinese->English. Experimental results show that our approaches yield up to +1.28 and +0.89 BLEU points improvements over the Transformer baseline, respectively.


Project CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks

arXiv.org Artificial Intelligence

Advancements in deep learning and machine learning algorithms have enabled breakthrough progress in computer vision, speech recognition, natural language processing and beyond. In addition, over the last several decades, software has been built into the fabric of every aspect of our society. Together, these two trends have generated new interest in the fast-emerging research area of AI for Code. As software development becomes ubiquitous across all industries and code infrastructure of enterprise legacy applications ages, it is more critical than ever to increase software development productivity and modernize legacy applications. Over the last decade, datasets like ImageNet, with its large scale and diversity, have played a pivotal role in algorithmic advancements from computer vision to language and speech understanding. In this paper, we present Project CodeNet, a first-of-its-kind, very large scale, diverse, and high-quality dataset to accelerate the algorithmic advancements in AI for Code. It consists of 14M code samples and about 500M lines of code in 55 different programming languages. Project CodeNet is not only unique in its scale, but also in the diversity of coding tasks it can help benchmark: from code similarity and classification for advances in code recommendation algorithms, and code translation between a large variety programming languages, to advances in code performance (both runtime, and memory) improvement techniques. CodeNet also provides sample input and output test sets for over 7M code samples, which can be critical for determining code equivalence in different languages. As a usability feature, we provide several preprocessing tools in Project CodeNet to transform source codes into representations that can be readily used as inputs into machine learning models.


How AI Is Accelerating Business Growth and Innovation

#artificialintelligence

Despite the many ominous connotations trumpeted in works of fiction, the adoption and growth of AI can is simply another phase of the technological advance that has marked the development of human society. Yet, because we associate intelligence with living creatures, particularly our own species, the idea of machines that possess that faculty excites some trepidation. AI agents may turn out to be as unpredictable and perverse as any intelligent human. No such worry is evident in Silicon Valley. Sundar Pichai, Google's chief, speaking at the World Economic Forum in Davos, Switzerland, enthused about the technology: "AI is probably the most important thing humanity has ever worked on. I think of it as something more profound than electricity or fire," he said. Google is a major participant in an AI market that is clipping along at a five-year compound annual growth rate (CAGR) of 17.5%. Globally, the industry is projected to swell to $554.3 billion by 2024. Other players of note are IBM, Intuit, Microsoft, OpenText, Palantir, SAS, and Slack.


Do Context-Aware Translation Models Pay the Right Attention?

arXiv.org Artificial Intelligence

Context-aware machine translation models are designed to leverage contextual information, but often fail to do so. As a result, they inaccurately disambiguate pronouns and polysemous words that require context for resolution. In this paper, we ask several questions: What contexts do human translators use to resolve ambiguous words? Are models paying large amounts of attention to the same context? What if we explicitly train them to do so? To answer these questions, we introduce SCAT (Supporting Context for Ambiguous Translations), a new English-French dataset comprising supporting context words for 14K translations that professional translators found useful for pronoun disambiguation. Using SCAT, we perform an in-depth analysis of the context used to disambiguate, examining positional and lexical characteristics of the supporting words. Furthermore, we measure the degree of alignment between the model's attention scores and the supporting context from SCAT, and apply a guided attention strategy to encourage agreement between the two.


Neural Machine Translation using a Seq2Seq Architecture and Attention (ENG to POR)

#artificialintelligence

Neural Machine Translation (NMT) is an end-to-end learning approach for automated translation [1]. Its strength comes from the fact that it learns the mapping directly from input text to associated output text. It has been proven to be more effective than traditional phrase-based machine translation, which requires much more effort to design the model. On the other hand, NMT models are costly to train, especially on large-scale translation datasets. They are also significantly slower at inference time due to the large number of parameters used.


Can Artificial Intelligence Create its Own Language?

#artificialintelligence

Back in 2017, the media was frenzying over Facebook's decision to scrap one of its artificial intelligence engines, which was said to have created its own language that could not be understood by humans. AI can sometimes be quirky and suspicious. In the current world technology is not a luxury, but a necessity. AI is one of the most popular of them, which has successfully aided several innovations across all industries. There have been many speculations around this disruptive technology and it has often been looked upon with fear.


Fujitsu releases hands-free speech translation service

The Japan Times

Fujitsu Ltd. on Thursday released a multilingual speech translation service that does not require users to operate devices by hand. The service is designed for settings in which multilingual communication is needed amid a rise in the domestic population of non-Japanese speakers, such as medical facilities. It automatically translates speech after identifying the voices and locations of users on the basis of sound picked up by directional microphones connected to tablet devices. Fujitsu said that the voice recognition is highly accurate thanks to technology limiting the effects of background noise. In addition to medical settings, the service is expected to be used at tourist sites.