AITopics

doi: 10.1609/aaai.v34i05.6448

1912.05134

Country:

Asia > Macao (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

arXiv.org Artificial IntelligenceDec-11-2019

MetaMT,a MetaLearning Method Leveraging Multiple Domain Data for Low Resource Machine Translation

Li, Rumeng, Wang, Xun, Yu, Hong

Manipulating training data leads to robust neural models for MT.

dataset, machine translation, translation, (14 more...)

doi: 10.1609/aaai.v34i05.6339

1912.05467

Country:

North America > United States > Massachusetts > Middlesex County > Lowell (0.14)
North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
Asia > Japan (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry:

Government (0.68)
Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

#artificialintelligenceDec-10-2019, 17:59:41 GMT

The quest for better training data

American localization specialist Lionbridge Technologies has been employing machine translation tools for many years. Eventually, its customers started asking for multilingual training data. Today, Lionbridge has a separate division entirely dedicated to AI, doing everything from collection of chatbot training data to image annotation, audio transcription and even multilingual content moderation services. To find out more about the work of the division, AI Business talked to Aristotelis Kostopoulos, vice president of product solutions, artificial intelligence at Lionbridge. Q: The AI division at Lionbridge grew out of the machine translation business, but today it does so much more.

learning, lionbridge, training data, (14 more...)

AI-Alerts: 2019 > 2019-12 > AAAI AI-Alert for Dec 17, 2019 (1.00)

Country: North America > United States > New York (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.76)

#artificialintelligenceDec-10-2019, 16:13:46 GMT

Conversations at High Altitude - Inside GTS Amsterdam - Welocalize

At a height of 100m up the amazing A'DAM Tower in central Amsterdam, the altitude wasn't a problem at Global Transformation Summit (GTS) but keeping up with the many shared experiences and fast exchange of ideas was! GTS Amsterdam brought together global brands, connecting international business leaders and senior marketing and localization professionals. What was the common ground? Many insights shared and new contacts made. As content types and volumes continue to increase – the growth of content on the internet doubles every 18 months – brands need to converge content, collaborate internally, and ensure the customer experience is consistent and personal, to stand out from online competition. This means re-imagining how we work – looking to define how multilingual content performs beyond traditional KPIs.

gt amsterdam, high altitude, information, (6 more...)

Country: Europe > Netherlands > North Holland > Amsterdam (0.86)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.33)

#artificialintelligenceDec-10-2019, 00:13:47 GMT

NVIDIA/OpenSeq2Seq

OpenSeq2Seq main goal is to allow researchers to most effectively explore various sequence-to-sequence models. The efficiency is achieved by fully supporting distributed and mixed-precision training. OpenSeq2Seq is built using TensorFlow and provides all the necessary building blocks for training encoder-decoder models for neural machine translation, automatic speech recognition, speech synthesis, and language modeling. Speech-to-text workflow uses some parts of Mozilla DeepSpeech project. Beam search decoder with language model re-scoring implementation (in decoders) is based on Baidu DeepSpeech.

nvidia openseq2seq, openseq2seq, workflow use, (1 more...)

Industry: Information Technology > Hardware (0.56)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

#artificialintelligenceDec-9-2019, 16:38:56 GMT

Your Brief Guide to Natural Language Processing (Part 1)

In recent years, natural language processing (NLP) has become a part of our everyday lives. Smartphones now come equipped with NLP-powered voice assistants that interpret and understand human speech in order to provide relevant responses to user queries. NLP also helps translation apps break down communication barriers by analyzing input in one language and transforming it into another language. Even word processors rely on NLP to check the grammar, logic, and syntax of written input. And NLP is now an integral part of customer service; it's used to guide people to the right representative through verbal commands. Yet, few people actually understand how NLP plays a role in making them possible.

artificial intelligence, machine translation, nlp, (13 more...)

Country:

North America > United States > California > San Francisco County > San Francisco (0.05)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.31)

#artificialintelligenceDec-9-2019, 07:49:28 GMT

Is it a Good Idea to Trust Machine Translations? - Globalja

Is it a Good Idea to Trust Machine Translations? Although machine translation (MT) has become extremely popular in recent years, it still has a long way to go before it can substitute a human translator. Does it mean that you shouldn t use machine translation?

globalja, good idea, trust machine translation

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Saparina, Irina, Osokin, Anton

Cost-Sensitive Training for Autoregressive Models

arXiv.org Machine LearningDec-8-2019

Training autoregressive models to better predict under the test metric, instead of maximizing the likelihood, has been reported to be beneficial in several use cases but brings additional complications, which prevent wider adoption. In this paper, we follow the learning-to-search approach (Daum\'e III et al., 2009; Leblond et al., 2018) and investigate its several components. First, we propose a way to construct a reference policy based on an alignment between the model output and ground truth. Our reference policy is optimal when applied to the Kendall-tau distance between permutations (appear in the task of word ordering) and helps when working with the METEOR score for machine translation. Second, we observe that the learning-to-search approach benefits from choosing the costs related to the test metrics. Finally, we study the effect of different learning objectives and find that the standard KL loss only learns several high-probability tokens and can be replaced with ranking objectives that target these tokens explicitly.

machine translation, neural machine translation, reference policy, (14 more...)

arXiv.org Machine Learning

1912.03771

Country:

Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
Asia > Russia (0.04)

Genre:

Research Report (0.82)
Workflow (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

arXiv.org Artificial IntelligenceDec-6-2019

Re-Translation Strategies For Long Form, Simultaneous, Spoken Language Translation

Arivazhagan, Naveen, Cherry, Colin, I, Te, Macherey, Wolfgang, Baljekar, Pallavi, Foster, George

We investigate the problem of simultaneous machine translation of long-form speech content. We target a continuous speech-to-text scenario, generating translated captions for a live audio feed, such as a lecture or play-by-play commentary. As this scenario allows for revisions to our incremental translations, we adopt a re-translation approach to simultaneous translation, where the source is repeatedly translated from scratch as it grows. This approach naturally exhibits very low latency and high final quality, but at the cost of incremental instability as the output is continuously refined. We experiment with a pipeline of industry-grade speech recognition and translation tools, augmented with simple inference heuristics to improve stability. We use TED Talks as a source of multilingual test data, developing our techniques on English-to-German spoken language translation. Our minimalist approach to simultaneous translation allows us to easily scale our final evaluation to six more target languages, dramatically improving incremental stability for all of them.

simultaneous translation, spoken language translation, translation, (13 more...)

1912.03393

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Italy > Tuscany > Florence (0.04)
Asia > Middle East > Jordan (0.04)
Asia > India > Karnataka > Bengaluru (0.04)

Genre: Research Report (0.40)

Industry:

Media > Radio (0.34)
Leisure & Entertainment (0.34)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

arXiv.org Artificial IntelligenceDec-5-2019

Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map Them to Actions

Schmidhuber, Juergen

We transform reinforcement learning (RL) into a form of supervised learning (SL) by turning traditional RL on its head, calling this Upside Down RL (UDRL). Standard RL predicts rewards, while UDRL instead uses rewards as task-defining inputs, together with representations of time horizons and other computable functions of historic and desired future data. UDRL learns to interpret these input observations as commands, mapping them to actions (or action probabilities) through SL on past (possibly accidental) experience. UDRL generalizes to achieve high rewards or other goals, through input commands such as: get lots of reward within at most so much time! A separate paper [61] on first experiments with UDRL shows that even a pilot version of UDRL can outperform traditional baseline algorithms on certain challenging RL problems. We also introduce a related simple but general approach for teaching a robot to imitate humans. First videotape humans imitating the robot's current behaviors, then let the robot learn through SL to map the videos (as input commands) to these behaviors, then let it generalize and imitate videos of humans executing previously unknown behavior. This Imitate-Imitator concept may actually explain why biological evolution has resulted in parents who imitate the babbling of their babies.

schmidhuber, sequence, step 2, (15 more...)

1912.02875

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)
(9 more...)

Genre: Research Report (0.50)

Industry: Health & Medicine (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.87)
(3 more...)