Machine Translation
Translating PDF documents using Amazon Translate and Amazon Textract
In 1993, the Portable Document Format or the PDF was born and released to the world. Since then, companies across various industries have been creating, scanning, and storing large volumes of documents in this digital format. These documents and the content within them are vital to supporting your business. Yet in many cases, the content is text-heavy and often written in a different language. This limits the flow of information and can directly influence your organization's business productivity and global expansion strategy.
Real world Applications of Natural Language Processing – Sushrut Tendulkar
Speech recognition is the ability of a machine or program to identify words and phrases in spoken language and convert them to a machine-readable format. Speech recognition has many applications, such as home automation, mobile telephony, virtual assistance, hands-free computing, video games, and so on. This is the application of Speech recognition where the machine converts text into speech so that it could be easily listened. Ex: Speechify is a startup that focuses on creating Audiobooks from any text. Machine Translation (MT) is the task of automatically converting one natural language into another, preserving the meaning of the input text, and producing fluent text in the output language.
Recent Trends in the Use of Deep Learning Models for Grammar Error Handling
Naghshnejad, Mina, Joshi, Tarun, Nair, Vijayan N.
Grammar error handling (GEH) is an important topic in natural language processing (NLP). GEH includes both grammar error detection and grammar error correction. Recent advances in computation systems have promoted the use of deep learning (DL) models for NLP problems such as GEH. In this survey we focus on two main DL approaches for GEH: neural machine translation models and editor models. We describe the three main stages of the pipeline for these models: data preparation, training, and inference. Additionally, we discuss different techniques to improve the performance of these models at each stage of the pipeline. We compare the performance of different models and conclude with proposed future directions.
Learning to summarize from human feedback
Stiennon, Nisan, Ouyang, Long, Wu, Jeff, Ziegler, Daniel M., Lowe, Ryan, Voss, Chelsea, Radford, Alec, Amodei, Dario, Christiano, Paul
As language models become more powerful, training and evaluation are increasingly bottlenecked by the data and metrics used for a particular task. For example, summarization models are often trained to predict human reference summaries and evaluated using ROUGE, but both of these metrics are rough proxies for what we really care about---summary quality. In this work, we show that it is possible to significantly improve summary quality by training a model to optimize for human preferences. We collect a large, high-quality dataset of human comparisons between summaries, train a model to predict the human-preferred summary, and use that model as a reward function to fine-tune a summarization policy using reinforcement learning. We apply our method to a version of the TL;DR dataset of Reddit posts and find that our models significantly outperform both human reference summaries and much larger models fine-tuned with supervised learning alone. Our models also transfer to CNN/DM news articles, producing summaries nearly as good as the human reference without any news-specific fine-tuning. We conduct extensive analyses to understand our human feedback dataset and fine-tuned models. We establish that our reward model generalizes to new datasets, and that optimizing our reward model results in better summaries than optimizing ROUGE according to humans. We hope the evidence from our paper motivates machine learning researchers to pay closer attention to how their training loss affects the model behavior they actually want.
How artificial intelligence and robotics are changing chemical research
An end-to-end, integrated chemical research system unveiled by IBM last week gives us a glimpse of how artificial intelligence, robotics and the cloud might change the future of drug discovery. And it's a good time as any to see some a breakthrough in the field. The world is still struggling with the covid-19 pandemic, and the race to the find a vaccine for the dangerous novel coronavirus has not yet yielded reliable results. Researchers are bound by travel and social distancing limitations imposed by the virus, and for the most part, they still rely on manual methods that can take many years. While in some cases, such delays can result in inconvenience, in the case of covid-19, it means more lives lost.
Langevin Cooling for Domain Translation
Srinivasan, Vignesh, Müller, Klaus-Robert, Samek, Wojciech, Nakajima, Shinichi
Domain translation is the task of finding correspondence between two domains. Several Deep Neural Network (DNN) models, e.g., CycleGAN and cross-lingual language models, have shown remarkable successes on this task under the unsupervised setting---the mappings between the domains are learned from two independent sets of training data in both domains (without paired samples). However, those methods typically do not perform well on a significant proportion of test samples. In this paper, we hypothesize that many of such unsuccessful samples lie at the fringe---relatively low-density areas---of data distribution, where the DNN was not trained very well, and propose to perform Langevin dynamics to bring such fringe samples towards high density areas. We demonstrate qualitatively and quantitatively that our strategy, called Langevin Cooling (L-Cool), enhances state-of-the-art methods in image translation and language translation tasks.
Ranking Policy Decisions
Pouget, Hadrien, Chockler, Hana, Sun, Youcheng, Kroening, Daniel
Policies trained via Reinforcement Learning (RL) are often needlessly complex, making them more difficult to analyse and interpret. In a run with $n$ time steps, a policy will decide $n$ times on an action to take, even when only a tiny subset of these decisions deliver value over selecting a simple default action. Given a pre-trained policy, we propose a black-box method based on statistical fault localisation that ranks the states of the environment according to the importance of decisions made in those states. We evaluate our ranking method by creating new, simpler policies by pruning decisions identified as unimportant, and measure the impact on performance. Our experimental results on a diverse set of standard benchmarks (gridworld, CartPole, Atari games) show that in some cases less than half of the decisions made contribute to the expected reward. We furthermore show that the decisions made in the most frequently visited states are not the most important for the expected reward.
Facebook Ramps Up Open Source Drive Into Speech Translation
Facebook continues to pour considerable resources into machine translation (MT); but, as evidenced by a recent Thai translation snafu, language technology remains a major challenge for the social media giant. In addition to improving quality estimation and various other initiatives, Facebook is currently working on two others that share information with the broader open source community, allowing developers to improve the technology. In a July 2020 blog post, Facebook AI made available CoVoST V2, a "massively multilingual" speech-to-text translation dataset. The original CoVoST was built on Mozilla's Common Voice, a database of crowdsourced voice recordings. This new version boasts 2,900 hours of speech, as well as speech translation data from 21 languages into English and from English into 15 languages.
Council Post: Do We Need More Data Or More Science In Data Science?
Is the success of Google that of the algorithms or that of data? Today's fascination with artificial intelligence (AI) reflects both our appetite for data and our excitement about the new opportunities in machine learning. Here, I argue that newcomers to the field of data science are blinded by the shiny object of magical algorithms -- and that they forget the critical infrastructures that are needed to create and to manage data in the first place. There are now many companies that provide AI services. An attractive offer should affirm all of the above -- the sole expertise in analyses and algorithms is generally insufficient, as it does not necessarily address the data part of the equation.
How to use Google Translate using Python – Predictive Hacks
All of us are aware of Google Translate. Today, we will provide examples of how we can use the googletrans which is a free and unlimited python library that implemented Google Translate API. This uses the Google Translate Ajax API to make calls to such methods as detect and translate . The first thing that we need to do is to install the googletrans library. I suggest to use the conda install command.