AITopics | Machine Translation

Collaborating Authors

Machine Translation

"Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another. One of the very earliest pursuits in computer science, MT has proved to be an elusive goal, but today a number of systems are available which produce output which, if not perfect, is of sufficient quality to be useful in a number of specific domains."
– Definition from the European Association for Machine Translation (EAMT).

You can translate text of your choice by using free translators such as: CAPITA, Google Translate, SDL International, SYSTRAN.

News Overviews Instructional Materials AI-Alerts Classics

Universal Approximation of Input-Output Maps by Temporal Convolutional Nets

Hanson, Joshua, Raginsky, Maxim

arXiv.org Machine LearningJun-21-2019

There has been a recent shift in sequence-to-sequence modeling from recurrent network architectures to convolutional network architectures due to computational advantages in training and operation while still achieving competitive performance. For systems having limited long-term temporal dependencies, the approximation capability of recurrent networks is essentially equivalent to that of temporal convolutional nets (TCNs). We prove that TCNs can approximate a large class of input-output maps having approximately finite memory to arbitrary error tolerance. Furthermore, we derive quantitative approximation rates for deep ReLU TCNs in terms of the width and depth of the network and modulus of continuity of the original input-output map, and apply these results to input-output maps of systems that admit finite-dimensional state-space realizations (i.e., recurrent models).

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

1906.09211

Country: North America > United States > Illinois (0.28)

Genre: Research Report (0.51)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.69)

Add feedback

Placeto: Learning Generalizable Device Placement Algorithms for Distributed Machine Learning

Addanki, Ravichandra, Venkatakrishnan, Shaileshh Bojja, Gupta, Shreyan, Mao, Hongzi, Alizadeh, Mohammad

arXiv.org Machine LearningJun-20-2019

We present Placeto, a reinforcement learning (RL) approach to efficiently find device placements for distributed neural network training. Unlike prior approaches that only find a device placement for a specific computation graph, Placeto can learn generalizable device placement policies that can be applied to any graph. We propose two key ideas in our approach: (1) we represent the policy as performing iterative placement improvements, rather than outputting a placement in one shot; (2) we use graph embeddings to capture relevant information about the structure of the computation graph, without relying on node labels for indexing. These ideas allow Placeto to train efficiently and generalize to unseen graphs. Our experiments show that Placeto requires up to 6.1x fewer training steps to find placements that are on par with or better than the best placements found by prior approaches. Moreover, Placeto is able to learn a generalizable placement policy for any given family of graphs, which can then be used without any retraining to predict optimized placements for unseen graphs from the same family. This eliminates the large overhead incurred by prior RL approaches whose lack of generalizability necessitates re-training from scratch every time a new graph is to be placed.

machine learning, natural language, reinforcement learning, (23 more...)

arXiv.org Machine Learning

1906.08879

Country:

North America > United States (0.68)
Europe (0.46)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.94)

Add feedback

The Challenge of Open Source MT SDL

#artificialintelligenceJun-19-2019, 18:14:22 GMT

The very large majority of open-source MT efforts fail because they do not consistently produce output that is equal to, or better than, any easily accessed public MT solution or because they cannot be deployed effectively. This is not to say that this is not possible, but the investments and long-term commitment required for success are often underestimated or simply not properly understood. A case can always be made for private systems that offer greater control and security, even if they are generally less accurate than public MT options. However, in the localization industry we see that if "free" MT solutions that are superior to an LSP-built system are available, translators will use them. We also find that for the few self-developed MT systems that do produce useful output quality, integration issues are often an impediment to deployment at enterprise scale and robustness. Some say that those who ignore the lessons of history are doomed to repeat errors.

artificial intelligence, natural language, open source mt sdl, (13 more...)

#artificialintelligence

Technology:

Information Technology > Software (0.66)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.43)

Add feedback

Machine Learning Testing: Survey, Landscapes and Horizons

Zhang, Jie M., Harman, Mark, Ma, Lei, Liu, Yang

arXiv.org Artificial IntelligenceJun-19-2019

This paper provides a comprehensive survey of Machine Learning Testing (ML testing) research. It covers 128 papers on testing properties (e.g., correctness, robustness, and fairness), testing components (e.g., the data, learning program, and framework), testing workflow (e.g., test generation and test evaluation), and application scenarios (e.g., autonomous driving, machine translation). The paper also analyses trends concerning datasets, research trends, and research focus, concluding with research challenges and promising research directions in ML testing.

machine learning, ml testing, natural language, (18 more...)

arXiv.org Artificial Intelligence

1906.10742

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Japan > Kyūshū & Okinawa > Kyūshū (0.04)
Asia > China > Hong Kong (0.04)
(12 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.87)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
Banking & Finance (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
(4 more...)

Add feedback

Unsupervised State Representation Learning in Atari

Anand, Ankesh, Racah, Evan, Ozair, Sherjil, Bengio, Yoshua, Côté, Marc-Alexandre, Hjelm, R Devon

arXiv.org Machine LearningJun-19-2019

State representation learning, or the ability to capture latent generative factors of an environment, is crucial for building intelligent agents that can perform a wide variety of tasks. Learning such representations without supervision from rewards is a challenging open problem. We introduce a method that learns state representations by maximizing mutual information across spatially and temporally distinct features of a neural encoder of the observations. We also introduce a new benchmark based on Atari 2600 games where we evaluate representations based on how well they capture the ground truth state variables. We believe this new framework for evaluating representation learning models will be crucial for future representation learning research. Finally, we compare our technique with other state-of-the-art generative and contrastive representation learning methods.

information, international conference, representation, (14 more...)

arXiv.org Machine Learning

1906.08226

Country:

North America > Canada > Quebec > Montreal (0.05)
North America > United States > Tennessee > Davidson County > Nashville (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(2 more...)

Genre: Research Report (0.40)

Industry:

Leisure & Entertainment > Games > Computer Games (0.46)
Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.86)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)

Add feedback

The Challenge of Open Source Machine Translation

#artificialintelligenceJun-18-2019, 16:41:58 GMT

We live in a time when there is a proliferation of open-source machine learning and AI-related development platforms. Thus, people believe that given a large amount of data and a few computers, a functional and useful MT system can be developed with a do-it-yourself (DIY) tool kit. However, as many who have tried have found out, the reality is much more complicated, and the path to success is long, winding and sometimes even treacherous. The very large majority of open-source MT efforts fail because they do not consistently produce output that is equal to, or better than, any easily accessed public MT solution or because they cannot be deployed effectively. This is not to say that this is not possible, but the investments and long-term commitment required for success are often underestimated or simply not properly understood. A case can always be made for private systems that offer greater control and security, even if they are generally less accurate than public MT options.

artificial intelligence, machine learning, natural language, (5 more...)

#artificialintelligence

Technology:

Information Technology > Software (0.88)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.43)
Information Technology > Artificial Intelligence > Machine Learning (0.42)
Information Technology > Communications > Social Media (0.40)

Add feedback

Misleading Failures of Partial-input Baselines

Feng, Shi, Wallace, Eric, Boyd-Graber, Jordan

arXiv.org Artificial IntelligenceJun-18-2019

Recent work establishes dataset difficulty and removes annotation artifacts via partial-input baselines (e.g., hypothesis-only models for SNLI or question-only models for VQA). When a partial-input baseline gets high accuracy, a dataset is cheatable. However, the converse is not necessarily true: the failure of a partial-input baseline does not mean a dataset is free of artifacts. To illustrate this, we first design artificial datasets which contain trivial patterns in the full input that are undetectable by any partial-input model. Next, we identify such artifacts in the SNLI dataset - a hypothesis-only model augmented with trivial patterns in the premise can solve 15% of the examples that are previously considered "hard". Our work provides a caveat for the use of partial-input baselines for dataset verification and creation.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

1905.05778

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.47)

Add feedback

Bridging the Gap between Training and Inference for Neural Machine Translation

Zhang, Wen, Feng, Yang, Meng, Fandong, You, Di, Liu, Qun

arXiv.org Machine LearningJun-17-2019

Neural Machine Translation (NMT) generates target words sequentially in the way of predicting the next word conditioned on the context words. At training time, it predicts with the ground truth words as context while at inference it has to generate the entire sequence from scratch. This discrepancy of the fed context leads to error accumulation among the way. Furthermore, word-level training requires strict matching between the generated sequence and the ground truth sequence which leads to overcorrection over different but reasonable translations. In this paper, we address these issues by sampling context words not only from the ground truth sequence but also from the predicted sequence by the model during training, where the predicted sequence is selected with a sentence-level optimum. Experiment results on Chinese->English and WMT'14 English->German translation tasks demonstrate that our approach can achieve significant improvements on multiple datasets.

artificial intelligence, natural language, translation, (16 more...)

arXiv.org Machine Learning

1906.02448

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Michigan (0.04)
North America > United States > Massachusetts > Worcester County > Worcester (0.04)
(5 more...)

Genre: Research Report (1.00)

Industry: Government > Regional Government > North America Government > United States Government (0.34)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Benchmarking Neural Machine Translation for Southern African Languages

Martinus, Laura, Abbott, Jade Z.

arXiv.org Machine LearningJun-17-2019

Unlike major Western languages, most African languages are very low-resourced. Furthermore, the resources that do exist are often scattered and difficult to obtain and discover. As a result, the data and code for existing research has rarely been shared. This has lead a struggle to reproduce reported results, and few publicly available benchmarks for African machine translation models exist. To start to address these problems, we trained neural machine translation models for 5 Southern African languages on publicly-available datasets. Code is provided for training the models and evaluate the models on a newly released evaluation set, with the aim of spur future research in the field for Southern African languages.

artificial intelligence, machine translation, natural language, (14 more...)

arXiv.org Machine Learning

1906.10511

Country: Africa (0.22)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Multilingual translation tools spread in Japan with new visa system

The Japan TimesJun-16-2019, 10:31:55 GMT

The use of multilingual translation tools is expanding in Japan, where foreign workers are expected to increase in the wake of April's launch of new visa categories. A growing number of local governments, labor unions and other entities have decided to introduce translation tools, which can help foreigners when going through administrative procedures as they allow local officials and other officers to talk to such applicants in their mother languages. "Talking in the applicants' own languages makes it easier to convey our cooperative stance," said an official in Tokyo's Sumida Ward. The ward introduced VoiceBiz, an audio translation app developed by Toppan Printing Co. that covers 30 languages. The app, which can be downloaded onto smartphones and tablet computers, will be used in eight municipalities, including Osaka and Ayase in Kanagawa Prefecture, company officials said.

artificial intelligence, machine translation, natural language, (9 more...)

The Japan Times

AI-Alerts: 2019 > 2019-06 > AAAI AI-Alert for Jun 18, 2019 (1.00)

Country:

Asia > Japan > Honshū > Kantō > Kanagawa Prefecture (0.60)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.27)
Asia > Japan > Honshū > Kansai > Osaka Prefecture > Osaka (0.27)
Asia > Japan > Shikoku > Tokushima Prefecture > Tokushima (0.10)

Industry: Government (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.92)

Add feedback