Goto

Collaborating Authors

 Machine Translation


Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining

arXiv.org Artificial Intelligence

Many real-world applications involve the use of Optical Character Recognition (OCR) engines to transform handwritten images into transcripts on which downstream Natural Language Processing (NLP) models are applied. In this process, OCR engines may introduce errors and inputs to downstream NLP models become noisy. Despite that pre-trained models achieve state-of-the-art performance in many NLP benchmarks, we prove that they are not robust to noisy texts generated by real OCR engines. This greatly limits the application of NLP models in real-world scenarios. In order to improve model performance on noisy OCR transcripts, it is natural to train the NLP model on labelled noisy texts. However, in most cases there are only labelled clean texts. Since there is no handwritten pictures corresponding to the text, it is impossible to directly use the recognition model to obtain noisy labelled data. Human resources can be employed to copy texts and take pictures, but it is extremely expensive considering the size of data for model training. Consequently, we are interested in making NLP models intrinsically robust to OCR errors in a low resource manner. We propose a novel robust training framework which 1) employs simple but effective methods to directly simulate natural OCR noises from clean texts and 2) iteratively mines the hard examples from a large number of simulated samples for optimal performance. 3) To make our model learn noise-invariant representations, a stability loss is employed. Experiments on three real-world datasets show that the proposed framework boosts the robustness of pre-trained models by a large margin. We believe that this work can greatly promote the application of NLP models in actual scenarios, although the algorithm we use is simple and straightforward. We make our codes and three datasets publicly available\footnote{https://github.com/tal-ai/Robust-learning-MSSHEM}.


A Survey on Data Augmentation for Text Classification

arXiv.org Artificial Intelligence

Data augmentation, the artificial creation of training data for machine learning by transformations, is a widely studied research field across machine learning disciplines. While it is useful for increasing the generalization capabilities of a model, it can also address many other challenges and problems, from overcoming a limited amount of training data over regularizing the objective to limiting the amount data used to protect privacy. Based on a precise description of the goals and applications of data augmentation (C1) and a taxonomy for existing works (C2), this survey is concerned with data augmentation methods for textual classification and aims to achieve a concise and comprehensive overview for researchers and practitioners (C3). Derived from the taxonomy, we divided more than 100 methods into 12 different groupings and provide state-of-the-art references expounding which methods are highly promising (C4). Finally, research perspectives that may constitute a building block for future work are given (C5).


Landscape Analysis: Neural Machine Translation

#artificialintelligence

The Big 3, when it comes to neural machine translation (NMT), are Google, Microsoft, and Amazon. Among this group, Google is the most dominant in terms of supporting 109 languages compared to Microsoft's 73, and Amazon's 55. Overall, Google is flush with talent, data, and resources, and they leverage those assets to maintain their dominant position. With that said, Google Translate is a tool that businesses like Native can license in order to leverage best-in-class technology. In this sense, Google is currently a key partner and will only become a competitor when Native builds out its own neural translation engine.


Improving Low-resource Reading Comprehension via Cross-lingual Transposition Rethinking

arXiv.org Artificial Intelligence

Extractive Reading Comprehension (ERC) has made tremendous advances enabled by the availability of large-scale high-quality ERC training data. Despite of such rapid progress and widespread application, the datasets in languages other than high-resource languages such as English remain scarce. To address this issue, we propose a Cross-Lingual Transposition ReThinking (XLTT) model by modelling existing high-quality extractive reading comprehension datasets in a multilingual environment. To be specific, we present multilingual adaptive attention (MAA) to combine intra-attention and inter-attention to learn more general generalizable semantic and lexical knowledge from each pair of language families. Furthermore, to make full use of existing datasets, we adopt a new training framework to train our model by calculating task-level similarities between each existing dataset and target dataset. The experimental results show that our XLTT model surpasses six baselines on two multilingual ERC benchmarks, especially more effective for low-resource languages with 3.9 and 4.1 average improvement in F1 and EM, respectively.


Zoom acquires an AI company building real-time translation

#artificialintelligence

Zoom has announced that it's acquiring a company known as Kites (short for Karlsruhe Information Technology Solutions), which has worked on creating real-time translation and transcription software. Zoom says the acquisition is a move to help it make communicating with people who speak different languages easier, and that it's looking to add translation capabilities to its video conferencing app. According to its site, Kites began at the Karlsruhe Institute of Technology, and its technology was originally developed to act as in-classroom translation for students who needed help understanding the English or German their professors were lecturing in. Zoom already has real-time transcriptions, but it's limited to people who are talking in English. On a support page, Zoom also makes it clear that its current live transcription feature may not meet certain accuracy requirements.


World University Medical School - World University and School Wiki

#artificialintelligence

I hope the budding, online, free World University Medical School - http://worlduniversity.wikia.com/wiki/World_University_Medical_School Dr. Judy Palfrey is moving to Washington DC from the Boston area to help further Universal Health Care in the Obama administration, I think. WUaS is planning for a "Admitted Students' Day" for the first, matriculating Bachelor's degree class, on or around Saturday, April 14th, 2014, and the second Saturday of April for other degrees in the future. Prevent and Reverse Heart Disease: The Revolutionary, Scientifically Proven, Nutrition-Based Cure. Dr. Dean Ornish's Program for Reversing Heart Disease: The Only System Scientifically Proven to Reverse Heart Disease Without Drugs or Surgery.


Question Answering over Knowledge Graphs with Neural Machine Translation and Entity Linking

arXiv.org Artificial Intelligence

The goal of Question Answering over Knowledge Graphs (KGQA) is to find answers for natural language questions over a knowledge graph. Recent KGQA approaches adopt a neural machine translation (NMT) approach, where the natural language question is translated into a structured query language. However, NMT suffers from the out-of-vocabulary problem, where terms in a question may not have been seen during training, impeding their translation. This issue is particularly problematic for the millions of entities that large knowledge graphs describe. We rather propose a KGQA approach that delegates the processing of entities to entity linking (EL) systems. NMT is then used to create a query template with placeholders that are filled by entities identified in an EL phase. Slot filling is used to decide which entity fills which placeholder. Experiments for QA over Wikidata show that our approach outperforms pure NMT: while there remains a strong dependence on having seen similar query templates during training, errors relating to entities are greatly reduced.


Top 10 Google Products Empowered by Artificial Intelligence

#artificialintelligence

For the past few years, Google has been dominating the field of artificial intelligence. Google's search engine has revolutionized the internet. From large-scale organizations to kids, Google's search engine has provided every one of us with easier access to information. The company claims that its advancements in technology and enhanced customer service would not have been possible had it not invested in disruptive technologies like artificial intelligence, machine learning, deep learning, and others. This article provides a list of the top 10 products manufactured by Google which are powered by artificial intelligence.


IITP at WAT 2021: System description for English-Hindi Multimodal Translation Task

arXiv.org Artificial Intelligence

Neural Machine Translation (NMT) is a predominant machine translation technology nowadays because of its end-to-end trainable flexibility. However, NMT still struggles to translate properly in low-resource settings specifically on distant language pairs. One way to overcome this is to use the information from other modalities if available. The idea is that despite differences in languages, both the source and target language speakers see the same thing and the visual representation of both the source and target is the same, which can positively assist the system. Multimodal information can help the NMT system to improve the translation by removing ambiguity on some phrases or words. We participate in the 8th Workshop on Asian Translation (WAT - 2021) for English-Hindi multimodal translation task and achieve 42.47 and 37.50 BLEU points for Evaluation and Challenge subset, respectively.


Zoom will have automatic translation in real time to videoconferences after buying the company Kites

#artificialintelligence

Video calling platforms and apps have taken on an unprecedented role since the arrival of Covid-19. One of the most important and popular is Zoom, which will now add a new real-time machine translation feature, after announcing the purchase of communications company Kites . Through its official blog, Zoom announced that they are in negotiations to acquire the company Karlsruhe Information Technology Solutions, abbreviated Kites . It is a German startup "dedicated to the development of real-time machine translation solutions" or MT, for its acronym in English. Zoom said that the acquisition of Kites represents the possibility of eliminating the language gaps between its users.