AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Latent Translation: Crossing Modalities by Bridging Generative Models

Tian, Yingtao, Engel, Jesse

arXiv.org Machine LearningFeb-21-2019

End-to-end optimization has achieved state-of-the-art performance on many specific problems, but there is no straight-forward way to combine pretrained models for new problems. Here, we explore improving modularity by learning a post-hoc interface between two existing models to solve a new task. Specifically, we take inspiration from neural machine translation, and cast the challenging problem of cross-modal domain transfer as unsupervised translation between the latent spaces of pretrained deep generative models. By abstracting away the data representation, we demonstrate that it is possible to transfer across different modalities (e.g., image-to-audio) and even different types of generative models (e.g., VAE-to-GAN). We compare to state-of-the-art techniques and find that a straight-forward variational autoencoder is able to best bridge the two generative models through learning a shared latent space. We can further impose supervised alignment of attributes in both domains with a classifier in the shared latent space. Through qualitative and quantitative evaluations, we demonstrate that locality and semantic alignment are preserved through the transfer process, as indicated by high transfer accuracies and smooth interpolations within a class. Finally, we show this modular structure speeds up training of new interface models by several orders of magnitude by decoupling it from expensive retraining of base generative models.

crossing modality, latent space, latent translation, (14 more...)

arXiv.org Machine Learning

1902.08261

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > Suffolk County > Stony Brook (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
(2 more...)

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

Shen, Jonathan, Nguyen, Patrick, Wu, Yonghui, Chen, Zhifeng, Chen, Mia X., Jia, Ye, Kannan, Anjuli, Sainath, Tara, Cao, Yuan, Chiu, Chung-Cheng, He, Yanzhang, Chorowski, Jan, Hinsu, Smit, Laurenzo, Stella, Qin, James, Firat, Orhan, Macherey, Wolfgang, Gupta, Suyog, Bapna, Ankur, Zhang, Shuyuan, Pang, Ruoming, Weiss, Ron J., Prabhavalkar, Rohit, Liang, Qiao, Jacob, Benoit, Liang, Bowen, Lee, HyoukJoong, Chelba, Ciprian, Jean, Sébastien, Li, Bo, Johnson, Melvin, Anil, Rohan, Tibrewal, Rajat, Liu, Xiaobing, Eriguchi, Akiko, Jaitly, Navdeep, Ari, Naveen, Cherry, Colin, Haghani, Parisa, Good, Otavio, Cheng, Youlong, Alvarez, Raziel, Caswell, Isaac, Hsu, Wei-Ning, Yang, Zongheng, Wang, Kuan-Chieh, Gonina, Ekaterina, Tomanek, Katrin, Vanik, Ben, Wu, Zelin, Jones, Llion, Schuster, Mike, Huang, Yanping, Chen, Dehao, Irie, Kazuki, Foster, George, Richardson, John, Macherey, Klaus, Bruguier, Antoine, Zen, Heiga, Raffel, Colin, Kumar, Shankar, Rao, Kanishka, Rybach, David, Murray, Matthew, Peddinti, Vijayaditya, Krikun, Maxim, Bacchiani, Michiel A. U., Jablin, Thomas B., Suderman, Rob, Williams, Ian, Lee, Benjamin, Bhatia, Deepti, Carlson, Justin, Yavuz, Semih, Zhang, Yu, McGraw, Ian, Galkin, Max, Ge, Qi, Pundak, Golan, Whipkey, Chad, Wang, Todd, Alon, Uri, Lepikhin, Dmitry, Tian, Ye, Sabour, Sara, Chan, William, Toshniwal, Shubham, Liao, Baohua, Nirschl, Michael, Rondon, Pat

arXiv.org Machine LearningFeb-21-2019

Lingvo is a Tensorflow framework offering a complete solution for collaborative deep learning research, with a particular focus towards sequence-to-sequence models. Lingvo models are composed of modular building blocks that are flexible and easily extensible, and experiment configurations are centralized and highly customizable. Distributed training and quantized inference are supported directly within the framework, and it contains existing implementations of a large number of utilities, helper functions, and the newest research ideas. Lingvo has been used in collaboration by dozens of researchers in more than 20 papers over the last two years. This document outlines the underlying design of Lingvo and serves as an introduction to the various pieces of the framework, while also offering examples of advanced features that showcase the capabilities of the framework.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

1902.08295

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.87)

Add feedback

What Microsoft and Google Are Not Telling You About Their A.I.

#artificialintelligenceFeb-19-2019, 16:11:02 GMT

In September of 2018, iFlytek, a Chinese technology company and world leader in A.I. -- particularly in voice recognition software -- was accused of disguising human translation as machine translation during a tech conference in Shanghai. The whistleblower was an interpreter, Bell Wang, who was doing live translation at the conference. He noticed that iFlytek was using his translations as live subtitles on a screen next to the company's brand logo. This gave the appearance that the translated output was produced by their A.I. system, rather than by Wang. The company was also broadcasting the translations live online using a computer-synthesized voice, instead of the original human interpreters' voices.

iflytek, microsoft and google, translation, (3 more...)

#artificialintelligence

Country:

Asia > China > Shanghai > Shanghai (0.27)
North America > United States > California (0.07)
Asia > China > Guangdong Province > Shenzhen (0.07)

Industry:

Information Technology (1.00)
Banking & Finance > Trading (0.36)

Technology:

Information Technology > Artificial Intelligence > Speech (0.87)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.59)

Add feedback

The future of content is autonomous London Business News Londonlovesbusiness.com

#artificialintelligenceFeb-19-2019, 14:27:23 GMT

SDL a global leader in content creation, translation and delivery, today calls on brands to rethink current content strategies, and prepare for a digital future where content supply chains are autonomous, machine-first and human optimized, for greater impact with worldwide audiences, across any language and device. Companies are struggling to handle the growing volume and velocity of content required to engage with global audiences. And it's expected to get worse: 93% say the content they produce will increase in the next two years. SDL's Enabling the Future of Content report addresses these challenges, offering insights on how companies can move towards an autonomous content supply chain of the future, capable of delivering any type of content to global audiences. Peggy Chen, CMO, SDL said, "Engaging with customers globally requires content, and lots of it.

autonomous london business new londonlovesbusiness, content supply chain, supply chain, (9 more...)

#artificialintelligence

Industry: Media > News (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.33)

Add feedback

Semantic Neural Machine Translation using AMR

Song, Linfeng, Gildea, Daniel, Zhang, Yue, Wang, Zhiguo, Su, Jinsong

arXiv.org Artificial IntelligenceFeb-19-2019

It is intuitive that semantic representations can be useful for machine translation, mainly because they can help in enforcing meaning preservation and handling data sparsity (many sentences correspond to one meaning) of machine translation models. On the other hand, little work has been done on leveraging semantics for neural machine translation (NMT). In this work, we study the usefulness of AMR (short for abstract meaning representation) on NMT. Experiments on a standard English-to-German dataset show that incorporating AMR as additional knowledge can significantly improve a strong attention-based sequence-to-sequence neural translation model.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

1902.07282

Country:

Asia > China > Fujian Province > Xiamen (0.04)
North America > United States > New York > Monroe County > Rochester (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

A spelling correction model for end-to-end speech recognition

Guo, Jinxi, Sainath, Tara N., Weiss, Ron J.

arXiv.org Artificial IntelligenceFeb-19-2019

Attention-based sequence-to-sequence models for speech recognition jointly train an acoustic model, language model (LM), and alignment mechanism using a single neural network and require only parallel audio-text pairs. Thus, the language model component of the end-to-end model is only trained on transcribed audio-text pairs, which leads to performance degradation especially on rare words. While there have been a variety of work that look at incorporating an external LM trained on text-only data into the end-to-end framework, none of them have taken into account the characteristic error distribution made by the model. In this paper, we propose a novel approach to utilizing text-only data, by training a spelling correction (SC) model to explicitly correct those errors. On the LibriSpeech dataset, we demonstrate that the proposed model results in an 18.6% relative improvement in WER over the baseline model when directly correcting top ASR hypothesis, and a 29.0% relative improvement when further rescoring an expanded n-best list using an external LM.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

1902.07178

Country: North America > United States > California > Los Angeles County > Los Angeles (0.14)

Genre: Research Report > New Finding (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.88)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

Amazon, Google, Microsoft Press Further into Customized Language Tech and Services Slator

#artificialintelligenceFeb-18-2019, 15:02:45 GMT

Companies such as Amazon, Google, Microsoft, and many others have rapidly expanded their machine learning offerings and now increasingly encroach on the heart of language services. Take Bridgeman Images, for example. Bridgeman is a "specialist in the distribution of fine art, cultural and historical media for reproduction" -- the Getty Images of the art world, if you will. According to an Amazon case study published on February 6, 2019, the company needed automated translation to localize into many languages at scale. They opted for Amazon Web Services' Amazon Translate to localize "570 million English characters into Italian, French, German, and Spanish" over the course of 15 days.

amazon, google, translation, (11 more...)

#artificialintelligence

Country: North America > United States (0.06)

Industry: Information Technology > Services (0.36)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Is the era of artificial speech translation upon us?

The GuardianFeb-17-2019, 10:50:56 GMT

Noise, Alex Waibel tells me, is one of the major challenges that artificial speech translation has to meet. A device may be able to recognise speech in a laboratory, or a meeting room, but will struggle to cope with the kind of background noise I can hear surrounding Professor Waibel as he speaks to me from Kyoto station. I'm struggling to follow him in English, on a scratchy line that reminds me we are nearly 10,000km apart – and that distance is still an obstacle to communication even if you're speaking the same language. We haven't reached the future yet. If we had, Waibel would have been able to speak in his native German and I would have been able to hear his words in English.

artificial speech translation, translation, translator, (15 more...)

The Guardian

Country:

Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.25)
North America > United States > Massachusetts (0.05)
Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.05)
Asia > China (0.05)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Google Translate is a manifestation of Wittgenstein's theory of language

#artificialintelligenceFeb-17-2019, 06:31:44 GMT

More than 60 years after philosopher Ludwig Wittgenstein's theories on language were published, the artificial intelligence behind Google Translate has provided a practical example of his hypotheses. Patrick Hebron, who works on machine learning in design at Adobe and studied philosophy with Wittgenstein expert Garry Hagberg for his bachelor's degree at Bard College, notes that the networks behind Google Translate are a very literal representation of Wittgenstein's work. Google employees have previously acknowledged that Wittgenstein's theories gave them a breakthrough in making their translation services more effective, but somehow, this key connection between philosophy of language and artificial intelligence has long gone under-celebrated and overlooked. The translation service relies on an algorithm created by Google employees called word2vec, which creates "vector representations" for words, which essentially means that each word is represented numerically. For the translations to work, programmers have to then create a "neural network," a form of machine learning, that's trained to understand how these words relate to each other.

google translate, representation, wittgenstein, (13 more...)

#artificialintelligence

Country:

Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.05)
Asia > Russia (0.05)
Asia > China > Beijing > Beijing (0.05)

Genre: Research Report (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.76)

Add feedback

Neural Machine Translation with Sequence to Sequence RNN - DATAVERSITY

#artificialintelligenceFeb-15-2019, 20:00:47 GMT

Click to learn more about author Rosaria Silipo. Automatic machine translation has been a popular subject for machine learning algorithms. After all, if machines can detect topics and understand texts, translation should be just the next step. Machine translation can be seen as a variation of natural language generation. In a previous project, we worked on the automatic generation of fairy tales (see "Once upon a Time … by LSTM Network").

node, sequence, translation, (13 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback