Goto

Collaborating Authors

 Machine Translation


Can Technology Replace Human Interpreters?

#artificialintelligence

Over the few past years, the demand for real-time interpretation services has increased considerably. The globalisation of business can be considered a huge contributing factor for this phenomenon, as it has increased the opportunities for international trade and opened new markets for businesses all around the world. In order to be competitive and keep up with this increase in demand for interpreting services, developers have been working on technological solutions to meet the requirements for high-quality simultaneous interpretations, but can tech really replace humans when it comes to interpreting? Real-time translation systems include applications that can be installed on smartphones, computers, or other gadgets linked to the Internet. The words of the speaker are transcribed by a computer server, which analyses the content and selects the closest translation from a vast collection of phrase pairs in its database.


Unsolved Problems in AI โ€“ AI Roadmap Institute Blog

#artificialintelligence

AI-complete problems are ones likely to contain all or most of human-level general artificial intelligence. A few problems in this category are listed below. Open-domain dialog is the problem of conducting competently a dialog with a human when the subject of the discussion is not known in advance. The challenge includes language understanding, dialog pragmatics, and understanding the world. Versions of the tasks include spoken and written dialog.


Google's AI translation tool seems to have invented its own language

#artificialintelligence

Creating a computer system to translate multiple languages is complex. The people at Google who built it wanted to find out just how clever their system was. So they came up with a challenge. They taught the machine to translate English to Japanese and vice versa. Then they taught it to translate English to Korean and also the reverse translation.


Latent Sequence Decompositions

arXiv.org Machine Learning

Sequence-to-sequence models rely on a fixed decomposition of the target sequences into a sequence of tokens that may be words, word-pieces or characters. The choice of these tokens and the decomposition of the target sequences into a sequence of tokens is often static, and independent of the input, output data domains. This can potentially lead to a sub-optimal choice of token dictionaries, as the decomposition is not informed by the particular problem being solved. In this paper we present Latent Sequence Decompositions (LSD), a framework in which the decomposition of sequences into constituent tokens is learnt during the training of the model. The decomposition depends both on the input sequence and on the output sequence. In LSD, during training, the model samples decompositions incrementally, from left to right by locally sampling between valid extensions.


The mind-blowing AI announcement from Google that you probably missed.

#artificialintelligence

Since originally writing this article, many people with far more expertise in these fields than myself have indicated that, while impressive, what Google have achieved is evolutionary, not revolutionary. In the very least, it's fair to say that I'm guilty of anthropomorphising in parts of the text. I've left the article's content unchanged, because I think it's interesting to compare the gut reaction I had with the subsequent comments from experts in the field. I strongly encourage readers to browse the comments beneath the version of this piece published on Medium.com In the closing weeks of 2016, Google published an article which quietly sailed under most people's radar.


Microsoft Translator publicly releases speech translation corpus

#artificialintelligence

As part of an ongoing effort within Microsoft to improve the accuracy of artificial intelligence (AI) systems, Microsoft Translator is publicly releasing a set of data that includes multiple conversations between bilingual speakers who are speaking French, German and English. This corpus, which was produced by Microsoft using bilingual speakers, aims to create a standard by which people can measure how well their conversational speech translation systems work. It can serve as a standardized data set for testing bilingual conversational speech translation systems such as the Microsoft Translator live feature and Skype Translator. Christian Federmann, a senior program manager working with the Microsoft Translator team, said there aren't as many standardized data sets for testing bilingual conversational speech translation systems. "You need high-quality data in order to have high-quality testing," Federmann said.


Morphology Generation for Statistical Machine Translation using Deep Learning Techniques

arXiv.org Machine Learning

Morphology in unbalanced languages remains a big challenge in the context of machine translation. In this paper, we propose to de-couple machine translation from morphology generation in order to better deal with the problem. We investigate the morphology simplification with a reasonable trade-off between expected gain and generation complexity. For the Chinese-Spanish task, optimum morphological simplification is in gender and number. For this purpose, we design a new classification architecture which, compared to other standard machine learning techniques, obtains the best results. This proposed neural-based architecture consists of several layers: an embedding, a convolutional followed by a recurrent neural network and, finally, ends with sigmoid and softmax layers. We obtain classification results over 98% accuracy in gender classification, over 93% in number classification, and an overall translation improvement of 0.7 METEOR.


Translation Software in Enterprise

AITopics Original Links

In an ideal world, everyone would speak the same language or at least be able to understand other languages fluently. But we don't live in that ideal world, yet. We do, however, live, work, and interact in a global society, where effective communication with co-workers is vital, and machine translation software has become a must for any company that works on internationally. There are many types of machine-based translation software. The two types most talked about assist translators and those who can do the translation themselves.


Learning to Suggest Phrases

AAAI Conferences

Intelligent keyboards can support writing by suggesting content. Certain types of phrases, when offered as suggestions, may be systematically chosen more often than their frequency in a corpus of text would predict. In order to generate those types of suggestions, we collected a dataset of how human authors responded to suggestions offered to them during open-ended writing tasks. We present an offline strategy for evaluating suggestions that enables us to learn the parameters of an improved suggestion generation policy without the expense of collecting additional data under that policy. We validate the approach by simulation and on human data by demonstrating improvement in held-out suggestion acceptance rate. Our approach can be applied to other scenarios where what is typical is not necessarily what is desirable.


When Will AI Make Engrish a Thing of the Past? - Nanalyze

#artificialintelligence

We've talked before about the rapid advances being made in the area of speech recognition. It's only a matter of time before companies like Doppler Labs augment our hearing such that we're able to experience real-time language translation. You'll soon be able to call him on that but only if you feel with 100% certainty that the translation is accurate. Why wouldn't language translation be accurate you ask? The simple answer here is one word.