AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Cost-Sensitive Training for Autoregressive Models

Saparina, Irina, Osokin, Anton

arXiv.org Machine LearningDec-8-2019

Training autoregressive models to better predict under the test metric, instead of maximizing the likelihood, has been reported to be beneficial in several use cases but brings additional complications, which prevent wider adoption. In this paper, we follow the learning-to-search approach (Daum\'e III et al., 2009; Leblond et al., 2018) and investigate its several components. First, we propose a way to construct a reference policy based on an alignment between the model output and ground truth. Our reference policy is optimal when applied to the Kendall-tau distance between permutations (appear in the task of word ordering) and helps when working with the METEOR score for machine translation. Second, we observe that the learning-to-search approach benefits from choosing the costs related to the test metrics. Finally, we study the effect of different learning objectives and find that the standard KL loss only learns several high-probability tokens and can be replaced with ranking objectives that target these tokens explicitly.

machine translation, neural machine translation, reference policy, (14 more...)

arXiv.org Machine Learning

1912.03771

Country:

Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
Asia > Russia (0.04)

Genre:

Research Report (0.82)
Workflow (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Re-Translation Strategies For Long Form, Simultaneous, Spoken Language Translation

Arivazhagan, Naveen, Cherry, Colin, I, Te, Macherey, Wolfgang, Baljekar, Pallavi, Foster, George

arXiv.org Artificial IntelligenceDec-6-2019

We investigate the problem of simultaneous machine translation of long-form speech content. We target a continuous speech-to-text scenario, generating translated captions for a live audio feed, such as a lecture or play-by-play commentary. As this scenario allows for revisions to our incremental translations, we adopt a re-translation approach to simultaneous translation, where the source is repeatedly translated from scratch as it grows. This approach naturally exhibits very low latency and high final quality, but at the cost of incremental instability as the output is continuously refined. We experiment with a pipeline of industry-grade speech recognition and translation tools, augmented with simple inference heuristics to improve stability. We use TED Talks as a source of multilingual test data, developing our techniques on English-to-German spoken language translation. Our minimalist approach to simultaneous translation allows us to easily scale our final evaluation to six more target languages, dramatically improving incremental stability for all of them.

simultaneous translation, spoken language translation, translation, (13 more...)

arXiv.org Artificial Intelligence

1912.03393

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Italy > Tuscany > Florence (0.04)
Asia > Middle East > Jordan (0.04)
Asia > India > Karnataka > Bengaluru (0.04)

Genre: Research Report (0.40)

Industry:

Media > Radio (0.34)
Leisure & Entertainment (0.34)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map Them to Actions

Schmidhuber, Juergen

arXiv.org Artificial IntelligenceDec-5-2019

We transform reinforcement learning (RL) into a form of supervised learning (SL) by turning traditional RL on its head, calling this Upside Down RL (UDRL). Standard RL predicts rewards, while UDRL instead uses rewards as task-defining inputs, together with representations of time horizons and other computable functions of historic and desired future data. UDRL learns to interpret these input observations as commands, mapping them to actions (or action probabilities) through SL on past (possibly accidental) experience. UDRL generalizes to achieve high rewards or other goals, through input commands such as: get lots of reward within at most so much time! A separate paper [61] on first experiments with UDRL shows that even a pilot version of UDRL can outperform traditional baseline algorithms on certain challenging RL problems. We also introduce a related simple but general approach for teaching a robot to imitate humans. First videotape humans imitating the robot's current behaviors, then let the robot learn through SL to map the videos (as input commands) to these behaviors, then let it generalize and imitate videos of humans executing previously unknown behavior. This Imitate-Imitator concept may actually explain why biological evolution has resulted in parents who imitate the babbling of their babies.

schmidhuber, sequence, step 2, (15 more...)

arXiv.org Artificial Intelligence

1912.02875

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)
(9 more...)

Genre: Research Report (0.50)

Industry: Health & Medicine (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.87)
(3 more...)

Add feedback

Cross-Language Aphasia Detection using Optimal Transport Domain Adaptation

Balagopalan, Aparna, Novikova, Jekaterina, McDermott, Matthew B. A., Nestor, Bret, Naumann, Tristan, Ghassemi, Marzyeh

arXiv.org Machine LearningDec-4-2019

Multi-language speech datasets are scarce and often have small sample sizes in the medical domain. Robust transfer of linguistic features across languages could improve rates of early diagnosis and therapy for speakers of low-resource languages when detecting health conditions from speech. We utilize out-of-domain, unpaired, single-speaker, healthy speech data for training multiple Optimal Transport (OT) domain adaptation systems. We learn mappings from other languages to English and detect aphasia from linguistic characteristics of speech, and show that OT domain adaptation improves aphasia detection over unilingual baselines for French (6% increased F1) and Mandarin (5% increased F1). Further, we show that adding aphasic data to the domain adaptation system significantly increases performance for both French and Mandarin, increasing the F1 scores further (10% and 8% increase in F1 scores for French and Mandarin, respectively, over unilingual baselines).

dataset, domain adaptation, mandarin, (13 more...)

arXiv.org Machine Learning

1912.0437

Country:

North America > Canada > Ontario > Toronto (0.14)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.46)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.46)

Add feedback

The Shallowness of Google Translate

#artificialintelligenceDec-3-2019, 18:46:56 GMT

One Sunday, at one of our weekly salsa sessions, my friend Frank brought along a Danish guest. I knew Frank spoke Danish well, since his mother was Danish, and he, as a child, had lived in Denmark. As for his friend, her English was fluent, as is standard for Scandinavians. However, to my surprise, during the evening's chitchat it emerged that the two friends habitually exchanged emails using Google Translate. Frank would write a message in English, then run it through Google Translate to produce a new text in Danish; conversely, she would write a message in Danish, then let Google Translate anglicize it.

google translate, shallowness, translation, (3 more...)

#artificialintelligence

Country: Europe > Denmark (0.27)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Move over, Google Translate: Here come A.I. earbuds

#artificialintelligenceDec-2-2019, 01:57:39 GMT

Forget phrase books or even Google Translate. New translation devices are getting closer to replicating the fantasy of the Babel fish, which in the "Hitchhiker's Guide to the Galaxy" sits in one's ear and instantly translates any foreign language into the user's own. The WT2 Plus Ear to Ear AI Translator Earbuds from Timekettle are already available, while the over-the-ear "Ambassador" from Wavery Labs is scheduled for release this year. Both brands are wireless, and come with two earpieces that must be synced to a single smartphone connected to Wi-Fi or cellular data. These devices "bring us a bit closer to being able to travel to places in the world where people speak different languages and communicate smoothly with those who are living there," said Graham Neubig, an assistant professor at the Language Technologies Institute of Carnegie Mellon University and an expert in machine learning and natural language processing.

earbud, google translate, translation, (14 more...)

#artificialintelligence

Country:

North America > United States > Colorado > Boulder County > Boulder (0.05)
Europe > Switzerland > Zürich > Zürich (0.05)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
Asia > China > Guangdong Province > Shenzhen (0.05)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

AWS adds 22 new languages to Amazon Translate ZDNet

#artificialintelligenceNov-29-2019, 07:31:36 GMT

Amazon Translate, Amazon Web Service's real-time translation service, is getting an update with support for 22 new languages. The announcement comes a week ahead of the AWS re:Invent conference, where AWS will promote Translate and a slew of other AI-powered tools for its cloud customers. AWS on Monday also announced new services related to image recognition, voice-based UIs and IOT. What is AI? Everything you need to know about Artificial Intelligence Amazon Translate now supports a total of 54 languages and dialects, with 2,804 language pairs now supported. The neural machine translation service enables customers to easily translate information from one language to many.

amazon translate zdnet, customer, new language, (6 more...)

#artificialintelligence

Country: North America > United States (0.06)

Industry: Information Technology (0.37)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.93)

Add feedback

Samsung Research Centers Around the World Take First Place in Prestigious AI Challenges

#artificialintelligenceNov-29-2019, 02:27:28 GMT

Samsung Electronics' Global Research & Development (R&D) Centers play a key part in developing artificial intelligence (AI) capabilities for real-world usage. A credit to the work this advanced R&D branch of Samsung undertakes, both Samsung R&D Institute Poland and Samsung Research America AI Center have recently won two prestigious global challenges. This year, Samsung R&D Institute Poland won first place in two categories, the first being text-to-text translation from English to Czech and the second – an end-to-end system translating English speech into German text. For the text-to-text translation category, researchers worked to develop a model to translate the transcript of a spoken English-language TED Talk into Czech. Developing their winning model required the Samsung team to develop large, filtered corpora from which to work and generate as much synthetic data as possible.

artificial intelligence, machine translation, natural language, (10 more...)

#artificialintelligence

Country:

Europe > Poland (0.57)
Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.06)
Asia > South Korea > Seoul > Seoul (0.06)
Asia > China > Beijing > Beijing (0.06)

Industry: Semiconductors & Electronics (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.53)

Add feedback

DiscoTK: Using Discourse Structure for Machine Translation Evaluation

Joty, Shafiq, Guzman, Francisco, Marquez, Lluis, Nakov, Preslav

arXiv.org Artificial IntelligenceNov-28-2019

We present novel automatic metrics for machine translation evaluation that use discourse structure and convolution kernels to compare the discourse tree of an automatic translation with that of the human reference. We experiment with five transformations and augmentations of a base discourse tree representation based on the rhetorical structure theory, and we combine the kernel scores for each of them into a single score. Finally, we add other metrics from the ASIYA MT evaluation toolkit, and we tune the weights of the combination on actual human judgments. Experiments on the WMT12 and WMT13 metrics shared task datasets show correlation with human judgments that outperforms what the best systems that participated in these years achieved, both at the segment and at the system level.

metric, representation, translation, (16 more...)

arXiv.org Artificial Intelligence

1911.12547

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Czechia > Prague (0.05)
North America > United States > Maryland > Baltimore (0.04)
(10 more...)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

22 New Languages And Variants, 6 New Regions For Amazon Translate Amazon Web Services

#artificialintelligenceNov-27-2019, 18:02:46 GMT

Just a few weeks ago, I told you about 7 new languages supported by Amazon Translate, our fully managed service for machine translation. Well, here I am again, announcing no less than 22 new languages and variants, as well as 6 additional AWS Regions where Translate is now available. Introducing 22 New Languages And Variants That's what I call an update! In addition to existing languages, Translate now supports: Afrikaans, Albanian, Amharic, Azerbaijani, Bengali, Bosnian, Bulgarian, Croatian, Dari, Estonian, Canadian French, Georgian, Hausa, Latvian, Pashto, Serbian, Slovak, Slovenian, Somali, Swahili, Tagalog, and Tamil. Congratulations if you can name all countries and regions of origin: I couldn't!

amazon translate, new language and variant, ruby translate, (11 more...)

#artificialintelligence

Country:

North America > United States > California (0.05)
Europe > Sweden > Stockholm > Stockholm (0.05)
Asia > China > Hong Kong (0.05)

Industry:

Retail > Online (0.40)
Information Technology > Services (0.40)

Technology:

Information Technology > Communications > Web (0.40)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.35)

Add feedback