AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Why Hasn't AI Mastered Language Translation? - Liwaiwai

#artificialintelligenceOct-25-2019, 23:17:10 GMT

Their creator observed, "And now nothing will be restrained from them, which they have imagined to do." According to the myth, God thwarted this effort by creating diverse languages so that they could no longer collaborate. Language remains a barrier in business and marketing. Even though technological devices can quickly and easily connect, humans from different parts of the world often can't. Translation agencies step in, making presentations, contracts, outsourcing instructions, and advertisements comprehensible to all intended recipients.

ai mastered language translation, translation, university, (12 more...)

#artificialintelligence

Country:

North America > Canada > Quebec (0.05)
Asia > Singapore (0.05)

Industry: Marketing (0.36)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Fast Structured Decoding for Sequence Models

Sun, Zhiqing, Li, Zhuohan, Wang, Haoqing, Lin, Zi, He, Di, Deng, Zhi-Hong

arXiv.org Machine LearningOct-25-2019

Autoregressive sequence models achieve state-of-the-art performance in domains like machine translation. However, due to the autoregressive factorization nature, these models suffer from heavy latency during inference. Recently, non-autoregressive sequence models were proposed to speed up the inference time. However, these models assume that the decoding process of each token is conditionally independent of others. Such a generation process sometimes makes the output sentence inconsistent, and thus the learned non-autoregressive models could only achieve inferior accuracy compared to their autoregressive counterparts. To improve then decoding consistency and reduce the inference cost at the same time, we propose to incorporate a structured inference module into the non-autoregressive models. Specifically, we design an efficient approximation for Conditional Random Fields (CRF) for non-autoregressive sequence models, and further propose a dynamic transition technique to model positional contexts in the CRF. Experiments in machine translation show that while increasing little latency (8~14ms), our model could achieve significantly better translation performance than previous non-autoregressive models on different translation datasets. In particular, for the WMT14 En-De dataset, our model obtains a BLEU score of 26.80, which largely outperforms the previous non-autoregressive baselines and is only 0.61 lower in BLEU than purely autoregressive models.

arxiv preprint arxiv, machine translation, translation, (11 more...)

arXiv.org Machine Learning

1910.11555

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
North America > Canada (0.04)
Asia (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Indic Language Computing

Communications of the ACMOct-24-2019, 23:40:39 GMT

In April 2019, following the Easter Sunday bomb attacks, the Government of Sri Lanka had to shut down Facebook and YouTube for nine days to stop the spreading of hate speech and false news, posted mainly in the local languages Sinhala and Tamil. This came about simply because these social media platforms did not have the capability to detect and warn about the provocative content. India's Ministry of Human Resource Development (MHRD) wants lectures on Swayama and NPTELb--the online teaching platforms--to be translated into all Indian languages. Approximately 2.5 million students use the Swayam lectures on computer science alone. The lectures are in English, which students find difficult to understand.

indic language, machine translation, translation, (12 more...)

Communications of the ACM

AI-Alerts: 2019 > 2019-10 > AAAI AI-Alert for Oct 29, 2019 (1.00)

Country:

Asia > Pakistan (0.05)
Europe > United Kingdom > England > Greater Manchester > Manchester (0.04)
Asia > Sri Lanka > Western Province > Colombo > Colombo (0.04)
(6 more...)

Genre: Instructional Material (0.34)

Industry:

Government > Regional Government > Asia Government (0.35)
Media > News (0.34)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.68)

Add feedback

Artificial intelligence: is it a double-edged sword?

#artificialintelligenceOct-24-2019, 04:26:50 GMT

Artificial Intelligence (AI) is already reconfiguring the world in conspicuous ways. Data drives our global digital ecosystem, and AI technologies reveal patterns in data. Smartphones, smart homes, and smart cities influence how we live and interact, and AI systems are increasingly involved in recruitment decisions, medical diagnoses and judicial verdicts. Whether this scenario is utopian or dystopian depends on your perspective. The potential risks of AI are enumerated repeatedly.

hankook mirae technology, international automobile exhibition, robot, (11 more...)

#artificialintelligence

Country:

Asia > South Korea > Seoul > Seoul (0.06)
Asia > China > Guangdong Province > Guangzhou (0.06)
Europe > Russia (0.05)
(7 more...)

Industry:

Government (0.96)
Information Technology > Robotics & Automation (0.71)
Leisure & Entertainment > Games > Computer Games (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.91)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.49)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.34)
Information Technology > Artificial Intelligence > Robots > Humanoid Robots (0.33)

Add feedback

Diversifying Topic-Coherent Response Generation for Natural Multi-turn Conversations

Hu, Fei, Liu, Wei, Mian, Ajmal Saeed, Li, Li

arXiv.org Artificial IntelligenceOct-24-2019

Although response generation (RG) diversification for single-turn dialogs has been well developed, it is less investigated for natural multi-turn conversations. Besides, past work focused on diversifying responses without considering topic coherence to the context, producing uninformative replies. In this paper, we propose the Topic-coherent Hierarchical Recurrent Encoder-Decoder model (THRED) to diversify the generated responses without deviating the contextual topics for multi-turn conversations. In overall, we build a sequence-to-sequence net (Seq2Seq) to model multi-turn conversations. And then we resort to the latent Variable Hierarchical Recurrent Encoder-Decoder model (VHRED) to learn global contextual distribution of dialogs. Besides, we construct a dense topic matrix which implies word-level correlations of the conversation corpora. The topic matrix is used to learn local topic distribution of the contextual utterances. By incorporating both the global contextual distribution and the local topic distribution, THRED produces both diversified and topic-coherent replies. In addition, we propose an explicit metric (\emph{TopicDiv}) to measure the topic divergence between the post and generated response, and we also propose an overall metric combining the diversification metric (\emph{Distinct}) and \emph{TopicDiv}. We evaluate our model comparing with three baselines (Seq2Seq, HRED and VHRED) on two real-world corpora, respectively, and demonstrate its outstanding performance in both diversification and topic coherence.

arxiv preprint arxiv, diversification, utterance, (13 more...)

arXiv.org Artificial Intelligence

1910.11161

Country:

Oceania > Australia > Western Australia (0.04)
Asia > China > Chongqing Province > Chongqing (0.04)
Europe > Netherlands (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.93)

Add feedback

IBM Brings AI Retrosynthetic Analysis to the Cloud IBM Research Blog

#artificialintelligenceOct-23-2019, 14:30:54 GMT

The future of computing is one of the strongest transformational forces on our planet. Everything we touch has built-in computing capabilities and is generating tremendous volumes of data. The impact is not only speeding up our daily lives, but also more traditional industrial sectors, including chemistry. Last year at the ACS Fall Meeting 2018 in Boston, IBM Research released IBM RXN for Chemistry, a cloud-based app that takes the idea of relating organic chemistry to a language. The magic behind the app is a state-of-the-art neural machine translation method, which can predict the most likely outcome of a chemical reaction using sequence-to-sequence (seq2seq) models.

chemical reaction, ibm rxn, retrosynthesis, (10 more...)

#artificialintelligence

Country:

Europe > Switzerland > Zürich > Zürich (0.07)
Europe > Italy > Tuscany > Pisa Province > Pisa (0.06)

Industry: Information Technology (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

What is Image Annotation? – An Intro to 5 Image Annotation Services

#artificialintelligenceOct-22-2019, 15:37:51 GMT

Image annotation is one of the most important tasks in computer vision. With numerous applications, computer vision essentially strives to give a machine eyes – the ability to see and interpret the world. At times, machine learning projects seem to unlock futuristic technology we never thought possible. AI-powered applications like augmented reality, automatic speech recognition, and neural machine translation have the potential to change lives and businesses around the world. Likewise, the technologies that computer vision can give us (autonomous vehicles, facial recognition, unmanned drones) are extraordinary.

annotate, annotation, annotator, (11 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.90)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.55)

Add feedback

How can AI Automate End-to-End Data Science?

Aggarwal, Charu, Bouneffouf, Djallel, Samulowitz, Horst, Buesser, Beat, Hoang, Thanh, Khurana, Udayan, Liu, Sijia, Pedapati, Tejaswini, Ram, Parikshit, Rawat, Ambrish, Wistuba, Martin, Gray, Alexander

arXiv.org Artificial IntelligenceOct-22-2019

Data science is labor-intensive and human experts are scarce but heavily involved in every aspect of it. This makes data science time consuming and restricted to experts with the resulting quality heavily dependent on their experience and skills. To make data science more accessible and scalable, we need its democratization. Automated Data Science (AutoDS) is aimed towards that goal and is emerging as an important research and business topic. We introduce and define the AutoDS challenge, followed by a proposal of a general AutoDS framework that covers existing approaches but also provides guidance for the development of new methods. We categorize and review the existing literature from multiple aspects of the problem setup and employed techniques. Then we provide several views on how AI could succeed in automating end-to-end AutoDS. We hope this survey can serve as insightful guideline for the AutoDS field and provide inspiration for future research.

architecture, data science, learning, (15 more...)

arXiv.org Artificial Intelligence

1910.14436

Country:

North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
Asia > Macao (0.04)
Asia > China (0.04)
(5 more...)

Genre:

Research Report (0.40)
Overview (0.34)

Industry:

Leisure & Entertainment > Games (0.68)
Health & Medicine (0.68)

Technology:

Information Technology > Data Science > Data Integration (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.96)
(3 more...)

Add feedback

AI could be a force for positive social change – but we're currently heading for a darker future

#artificialintelligenceOct-20-2019, 11:03:32 GMT

Artificial intelligence (AI) is already re-configuring the world in conspicuous ways. Data drives our global digital ecosystem, and AI technologies reveal patterns in data. Smartphones, smart homes, and smart cities influence how we live and interact, and AI systems are increasingly involved in recruitment decisions, medical diagnoses, and judicial verdicts. Whether this scenario is utopian or dystopian depends on your perspective. The potential risks of AI are enumerated repeatedly.

ai system, training data, translation system, (5 more...)

#artificialintelligence

Industry: Information Technology (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.98)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.57)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.35)

Add feedback

Byte-Pair Encoding for Text-to-SQL Generation

Müller, Samuel, Vlachos, Andreas

arXiv.org Machine LearningOct-20-2019

Neural sequence-to-sequence models provide a competitive approach to the task of mapping a question in natural language to an SQL query, also referred to as text-to-SQL generation. The Byte-Pair Encoding algorithm (BPE) has previously been used to improve machine translation (MT) between natural languages. In this work, we adapt BPE for text-to-SQL generation. As the datasets for this task are rather small compared to MT, we present a novel stopping criterion that prevents overfitting the BPE encoding to the training set. Additionally, we present AST BPE, which is a version of BPE that uses the Abstract Syntax Tree (AST) of the SQL statement to guide BPE merges and therefore produce BPE encodings that generalize better. W e improved the accuracy of a strong attentive seq2seq baseline on five out of six English text-to-SQL tasks while reducing training time by more than 50% on four of them due to the shortened targets. Finally, on two of these tasks we exceeded previously reported accuracies.

bpe, dataset, query, (15 more...)

arXiv.org Machine Learning

1910.08962

Country:

North America > United States > Alabama (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Switzerland > Geneva > Geneva (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback