AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Translating PDF documents using Amazon Translate and Amazon Textract

#artificialintelligenceSep-8-2020, 18:29:47 GMT

In 1993, the Portable Document Format or the PDF was born and released to the world. Since then, companies across various industries have been creating, scanning, and storing large volumes of documents in this digital format. These documents and the content within them are vital to supporting your business. Yet in many cases, the content is text-heavy and often written in a different language. This limits the flow of information and can directly influence your organization's business productivity and global expansion strategy.

artificial intelligence, machine learning, natural language, (16 more...)

#artificialintelligence

Industry: Retail > Online (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.67)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.53)

Add feedback

Real world Applications of Natural Language Processing – Sushrut Tendulkar

#artificialintelligenceSep-4-2020, 05:35:34 GMT

Speech recognition is the ability of a machine or program to identify words and phrases in spoken language and convert them to a machine-readable format. Speech recognition has many applications, such as home automation, mobile telephony, virtual assistance, hands-free computing, video games, and so on. This is the application of Speech recognition where the machine converts text into speech so that it could be easily listened. Ex: Speechify is a startup that focuses on creating Audiobooks from any text. Machine Translation (MT) is the task of automatically converting one natural language into another, preserving the meaning of the input text, and producing fluent text in the output language.

artificial intelligence, natural language, text processing, (12 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.79)

Add feedback

Recent Trends in the Use of Deep Learning Models for Grammar Error Handling

Naghshnejad, Mina, Joshi, Tarun, Nair, Vijayan N.

arXiv.org Artificial IntelligenceSep-4-2020

Grammar error handling (GEH) is an important topic in natural language processing (NLP). GEH includes both grammar error detection and grammar error correction. Recent advances in computation systems have promoted the use of deep learning (DL) models for NLP problems such as GEH. In this survey we focus on two main DL approaches for GEH: neural machine translation models and editor models. We describe the three main stages of the pipeline for these models: data preparation, training, and inference. Additionally, we discuss different techniques to improve the performance of these models at each stage of the pipeline. We compare the performance of different models and conclude with proposed future directions.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2009.02358

Country:

Asia > Singapore (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Czechia > Prague (0.04)

Genre:

Research Report (1.00)
Overview (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.67)

Add feedback

Learning to summarize from human feedback

Stiennon, Nisan, Ouyang, Long, Wu, Jeff, Ziegler, Daniel M., Lowe, Ryan, Voss, Chelsea, Radford, Alec, Amodei, Dario, Christiano, Paul

arXiv.org Artificial IntelligenceSep-2-2020

As language models become more powerful, training and evaluation are increasingly bottlenecked by the data and metrics used for a particular task. For example, summarization models are often trained to predict human reference summaries and evaluated using ROUGE, but both of these metrics are rough proxies for what we really care about---summary quality. In this work, we show that it is possible to significantly improve summary quality by training a model to optimize for human preferences. We collect a large, high-quality dataset of human comparisons between summaries, train a model to predict the human-preferred summary, and use that model as a reward function to fine-tune a summarization policy using reinforcement learning. We apply our method to a version of the TL;DR dataset of Reddit posts and find that our models significantly outperform both human reference summaries and much larger models fine-tuned with supervised learning alone. Our models also transfer to CNN/DM news articles, producing summaries nearly as good as the human reference without any news-specific fine-tuning. We conduct extensive analyses to understand our human feedback dataset and fine-tuned models. We establish that our reward model generalizes to new datasets, and that optimizing our reward model results in better summaries than optimizing ROUGE according to humans. We hope the evidence from our paper motivates machine learning researchers to pay closer attention to how their training loss affects the model behavior they actually want.

labeler, machine learning, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

2009.01325

Country:

Asia > Middle East > Israel (0.04)
North America > United States > California > Los Angeles County > Los Angeles (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
(2 more...)

Genre: Research Report > New Finding (0.92)

Industry:

Leisure & Entertainment (0.93)
Transportation (0.68)
Education > Educational Setting (0.46)
Media > News (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.66)

Add feedback

How artificial intelligence and robotics are changing chemical research

#artificialintelligenceSep-1-2020, 11:30:09 GMT

An end-to-end, integrated chemical research system unveiled by IBM last week gives us a glimpse of how artificial intelligence, robotics and the cloud might change the future of drug discovery. And it's a good time as any to see some a breakthrough in the field. The world is still struggling with the covid-19 pandemic, and the race to the find a vaccine for the dangerous novel coronavirus has not yet yielded reliable results. Researchers are bound by travel and social distancing limitations imposed by the virus, and for the most part, they still rely on manual methods that can take many years. While in some cases, such delays can result in inconvenience, in the case of covid-19, it means more lives lost.

machine learning, natural language, roborxn, (20 more...)

#artificialintelligence

Country: Europe > Switzerland > Zürich > Zürich (0.05)

Genre: Research Report > Promising Solution (0.34)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.69)

Add feedback

Langevin Cooling for Domain Translation

Srinivasan, Vignesh, Müller, Klaus-Robert, Samek, Wojciech, Nakajima, Shinichi

arXiv.org Machine LearningAug-31-2020

Domain translation is the task of finding correspondence between two domains. Several Deep Neural Network (DNN) models, e.g., CycleGAN and cross-lingual language models, have shown remarkable successes on this task under the unsupervised setting---the mappings between the domains are learned from two independent sets of training data in both domains (without paired samples). However, those methods typically do not perform well on a significant proportion of test samples. In this paper, we hypothesize that many of such unsuccessful samples lie at the fringe---relatively low-density areas---of data distribution, where the DNN was not trained very well, and propose to perform Langevin dynamics to bring such fringe samples towards high density areas. We demonstrate qualitatively and quantitatively that our strategy, called Langevin Cooling (L-Cool), enhances state-of-the-art methods in image translation and language translation tasks.

deep learning, l-cool, neural network, (20 more...)

arXiv.org Machine Learning

2008.13723

Country:

Europe > Germany (0.28)
Asia > South Korea (0.14)
Asia > Japan (0.14)

Genre: Research Report (1.00)

Industry: Energy > Oil & Gas (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Ranking Policy Decisions

Pouget, Hadrien, Chockler, Hana, Sun, Youcheng, Kroening, Daniel

arXiv.org Machine LearningAug-31-2020

Policies trained via Reinforcement Learning (RL) are often needlessly complex, making them more difficult to analyse and interpret. In a run with $n$ time steps, a policy will decide $n$ times on an action to take, even when only a tiny subset of these decisions deliver value over selecting a simple default action. Given a pre-trained policy, we propose a black-box method based on statistical fault localisation that ranks the states of the environment according to the importance of decisions made in those states. We evaluate our ranking method by creating new, simpler policies by pruning decisions identified as unimportant, and measure the impact on performance. Our experimental results on a diverse set of standard benchmarks (gridworld, CartPole, Atari games) show that in some cases less than half of the decisions made contribute to the expected reward. We furthermore show that the decisions made in the most frequently visited states are not the most important for the expected reward.

machine learning, natural language, reinforcement learning, (19 more...)

arXiv.org Machine Learning

2008.13607

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > United Kingdom > England > Greater London > London (0.04)
Europe > Netherlands > South Holland > Delft (0.04)
Asia > Japan (0.04)

Genre: Research Report (0.64)

Industry:

Leisure & Entertainment > Games > Computer Games (0.57)
Transportation > Air (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.46)

Add feedback

Facebook Ramps Up Open Source Drive Into Speech Translation

#artificialintelligenceAug-28-2020, 23:59:17 GMT

Facebook continues to pour considerable resources into machine translation (MT); but, as evidenced by a recent Thai translation snafu, language technology remains a major challenge for the social media giant. In addition to improving quality estimation and various other initiatives, Facebook is currently working on two others that share information with the broader open source community, allowing developers to improve the technology. In a July 2020 blog post, Facebook AI made available CoVoST V2, a "massively multilingual" speech-to-text translation dataset. The original CoVoST was built on Mozilla's Common Voice, a database of crowdsourced voice recordings. This new version boasts 2,900 hours of speech, as well as speech translation data from 21 languages into English and from English into 15 languages.

artificial intelligence, natural language, social media, (10 more...)

#artificialintelligence

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Council Post: Do We Need More Data Or More Science In Data Science?

#artificialintelligenceAug-28-2020, 12:30:23 GMT

Is the success of Google that of the algorithms or that of data? Today's fascination with artificial intelligence (AI) reflects both our appetite for data and our excitement about the new opportunities in machine learning. Here, I argue that newcomers to the field of data science are blinded by the shiny object of magical algorithms -- and that they forget the critical infrastructures that are needed to create and to manage data in the first place. There are now many companies that provide AI services. An attractive offer should affirm all of the above -- the sole expertise in analyses and algorithms is generally insufficient, as it does not necessarily address the data part of the equation.

artificial intelligence, machine learning, natural language, (13 more...)

#artificialintelligence

Industry: Health & Medicine (0.30)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback

How to use Google Translate using Python – Predictive Hacks

#artificialintelligenceAug-24-2020, 10:55:15 GMT

All of us are aware of Google Translate. Today, we will provide examples of how we can use the googletrans which is a free and unlimited python library that implemented Google Translate API. This uses the Google Translate Ajax API to make calls to such methods as detect and translate . The first thing that we need to do is to install the googletrans library. I suggest to use the conda install command.

artificial intelligence, google translate, natural language, (9 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback