AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Ten Machine Learning Algorithms You Should Know to Become a Data Scientist - ParallelDots

#artificialintelligenceJan-18-2019, 13:07:26 GMT

Let's say I am given an Excel sheet with data about various fruits and I have to tell which look like Apples. What I will do is ask a question "Which fruits are red and round?" and divide all fruits which answer yes and no to the question. Now, All Red and Round fruits might not be apples and all apples won't be red and round. So I will ask a question "Which fruits have red or yellow color hints on them? " on red and round fruits and will ask "Which fruits are green and round?" on not red and round fruits. Based on these questions I can tell with considerable accuracy which are apples. This cascade of questions is what a decision tree is. However, this is a decision tree based on my intuition.

machine learning, natural language, reinforcement learning, (17 more...)

#artificialintelligence

Country: North America > United States (0.15)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.73)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)

Add feedback

What is the current biggest hurdle for AI innovation? Gengo AI

#artificialintelligenceJan-18-2019, 09:37:04 GMT

In a previous article, I discussed the current pace of AI innovation. The shortage of available AI training data is a huge blocker in AI innovation today, leaving some businesses frustrated. In recent years, some media channels hyped up that AI technology will advance exponentially at lightning speed, but so far that has not happened. We don't have enough AI training data because companies often underestimate the amount of data they need, and the time to collect that data. The few companies invested in data collection often refuse to make their data public, usually due to privacy concerns or fear of losing to their competitors.

machine learning, natural language, translation, (13 more...)

#artificialintelligence

Industry: Information Technology (0.42)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Evaluating Text Output in NLP: BLEU at your own risk

#artificialintelligenceJan-16-2019, 13:43:38 GMT

One question I get fairly often from folks who are just getting into NLP is how to evaluate systems when the output of that system is text, rather than some sort of classification of the input text. These types of problems, where you put some text into your model and get some other text out of it, are known as sequence to sequence or string transduction problems. This sort of technology is right out of science fiction. With such a wide range of exciting applications, it's easy to see why sequence to sequence modeling is more popular than ever. What's not easy is actually evaluating these systems. Unfortunately for folks who are just getting started, there's no simple answer about what metric you should use to evaluate your model. Even worse, one of the most popular metrics for evaluating sequence to sequence tasks, BLEU, has major drawbacks, especially when applied to tasks that it was never intended to evaluate.

artificial intelligence, natural language, translation, (18 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.97)

Add feedback

Artificial Intelligence Is Changing The Translation Industry. But Will It Work?

#artificialintelligenceJan-14-2019, 00:41:45 GMT

Artificial intelligence (AI) has infiltrated numerous aspects of our lives in recent years, thanks to improvements in the field of machine learning, where computers ostensibly program themselves. This drive towards digital self-learning has led to major breakthroughs in our day-to-day interactions with machines, most notably the rise of digital home assistants such as Amazon Echo, and the recently launched Google Lens, which identifies objects based on visual cues from your phone's camera. One of the most widely-discussed advances has been the use of AI in translation. Not unlike the Babel Fish from The Hitchhiker's Guide to the Galaxy, with AI translation, "you can instantly understand anything said to you in any form of language." The technology works by recognizing words individually and then, as MIT Technology Review puts it, "takes advantage of the fact that relationships between certain words…are similar across languages" to create its translations. It has already found its way into a number of our most commonly used websites and platforms, with even grander plans in the pipeline – but just how reliable is the technology?

machine learning, natural language, translation, (12 more...)

#artificialintelligence

Industry: Information Technology (0.71)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.72)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.56)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.53)

Add feedback

XNet: GAN Latent Space Constraints

Sendik, Omry, Lischinski, Dani, CohenOr, Daniel

arXiv.org Machine LearningJan-14-2019

Recent GAN-based architectures have been able to deliver impressive performance on the general task of image-to-image translation. In particular, it was shown that a wide variety of image translation operators may be learned from two image sets, containing images from two different domains, without establishing an explicit pairing between the images. This was made possible by introducing clever regularizers to overcome the under-constrained nature of the unpaired translation problem. In this work, we introduce a novel architecture for unpaired image translation, and explore several new regularizers enabled by it. Specifically, our architecture comprises a pair of GANs, as well as a pair of translators between their respective latent spaces. These cross-translators enable us to impose several regularizing constraints on the learnt image translation operator, collectively referred to as latent cross-consistency. Our results show that our proposed architecture and latent cross-consistency constraints are able to outperform the existing state-of-the-art on a wide variety of image translation tasks.

architecture, latent space, translation, (15 more...)

arXiv.org Machine Learning

1901.0453

Country:

Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.05)
North America > United States > New York > New York County > New York City (0.04)
Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.04)

Genre: Research Report > New Finding (0.70)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.48)

Add feedback

Google Translate will help Wikipedia fill its non-English websites

EngadgetJan-11-2019, 00:43:26 GMT

Google is helping the Wikimedia Foundation achieve its goal of making Wikipedia articles available in a lot more languages. The Foundation has added Google Translate to its content translation tool, which human editors can use to add content to non-English Wikipedia websites. Those editors can take advantage of the new option -- "one of the most advanced machine translation systems available today," the foundation called it -- to generate an initial translation that they can then review and edit for readability in their language. The Foundation says volunteer Wikipedia editors have been asking for Google Translate integration for a long time now. According to VentureBeat, this move is an expansion of an earlier partnership, wherein Google promised to help Wikipedia make its English posts more accessible in Indonesia.

artificial intelligence, google translate, natural language, (5 more...)

Engadget

Country: Asia > Indonesia (0.29)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Computational Register Analysis and Synthesis

Argamon, Shlomo Engelson

arXiv.org Artificial IntelligenceJan-8-2019

The study of register in computational language research has historically been divided into register analysis, seeking to determine the registerial character of a text or corpus, and register synthesis, seeking to generate a text in a desired register. This article surveys the different approaches to these disparate tasks. Register synthesis has tended to use more theoretically articulated notions of register and genre than analysis work, which often seeks to categorize on the basis of intuitive and somewhat incoherent notions of prelabeled 'text types'. I argue that an integration of computational register analysis and synthesis will benefit register studies as a whole, by enabling a new large-scale research program in register studies. It will enable comprehensive global mapping of functional language varieties in multiple languages, including the relationships between them. Furthermore, computational methods together with high coverage systematically collected and analyzed data will thus enable rigorous empirical validation and refinement of different theories of register, which will have also implications for our understanding of linguistic variation in general.

data mining, machine learning, variation, (24 more...)

arXiv.org Artificial Intelligence

1901.02543

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Netherlands > South Holland > Dordrecht (0.04)
(15 more...)

Genre:

Research Report > New Finding (0.46)
Research Report > Experimental Study (0.46)

Industry:

Health & Medicine (0.46)
Media > News (0.46)
Education (0.46)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(8 more...)

Add feedback

Ministry earmarks subsidies totaling ¥20 million to set up translation systems for foreign students at schools

The Japan TimesJan-7-2019, 05:08:56 GMT

The education ministry plans to set up a new subsidy system for prefectures and large cities that offer detailed support to foreign students attending public elementary and junior high schools and their parents through the use of multilingual translation systems. The subsidies will be offered to prefectural governments, ordinance-designated major cities and other core cities that use tablet computers with multilingual speech translation functions when teaching Japanese to students from abroad at school and providing school guidance to their parents. The ministry has set aside ¥20 million for the subsidy system, which is designed to cover one-third of related costs, under the government's fiscal 2019 budget. According to sources, 100 language support programs are likely to become eligible for the financial aid. The launch of the new subsidy system comes in line with the government's policy of allowing more foreign workers to enter the country.

artificial intelligence, machine translation, natural language, (5 more...)

The Japan Times

Industry: Education > Educational Setting (0.74)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.99)

Add feedback

Ministry earmarks subsidies totaling ¥20 million to set up translation systems for foreign students at schools

The Japan TimesJan-6-2019, 13:43:22 GMT

The education ministry plans to set up a new subsidy system for prefectures and large cities that offer detailed support to foreign students attending public elementary and junior high schools and their parents by using multilingual translation systems. The subsidies will be offered to prefectural governments, ordinance-designated major cities and other core cities that use tablet computers with multilingual speech translation functions in teaching Japanese to students from abroad at school and providing school guidance to their parents. The ministry has set aside ¥20 million for the subsidy system, which is designed to cover one-third of related costs, under the government's fiscal 2019 budget, with 100 language support programs likely to become eligible for the financial aid, informed sources said. The launch of the new subsidy system comes in line with the government's policy of allowing more foreign workers to come here. The number of foreign students in Japan needing Japanese language education totaled 43,947 in fiscal 2016, up 70 percent from 26,281 in fiscal 2006.

artificial intelligence, machine translation, natural language, (5 more...)

The Japan Times

Country: Asia > Japan (0.29)

Industry: Education > Educational Setting (0.74)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.99)

Add feedback

Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback

Zhang, Chicheng, Agarwal, Alekh, Daumé, Hal III, Langford, John, Negahban, Sahand N

arXiv.org Machine LearningJan-2-2019

We investigate the feasibility of learning from both fully-labeled supervised data and contextual bandit data. We specifically consider settings in which the underlying learning signal may be different between these two data sources. Theoretically, we state and prove no-regret algorithms for learning that is robust to divergences between the two sources. Empirically, we evaluate some of these algorithms on a large selection of datasets, showing that our approaches are feasible, and helpful in practice.

algorithm, cumulative distribution function, probability 1, (13 more...)

arXiv.org Machine Learning

1901.00301

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Virginia > Arlington County > Arlington (0.04)
North America > United States > Maryland (0.04)
(3 more...)

Genre: Research Report (0.81)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.70)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback