AITopics

doi: 10.1613/jair.4761

AI Access Foundation

10974

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Czechia > South Moravian Region > Brno (0.04)
Europe > Norway > Eastern Norway > Oslo (0.04)
(31 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Government (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Costa-jussà, Marta R., Bangalore, Srinivas, Lambert, Patrik, Màrquez, Lluís, Montiel-Ponsoda, Elena

Introduction to the Special Issue on Cross-Language Algorithms and Applications

Journal of Artificial Intelligence ResearchJan-12-2016

With the increasingly global nature of our everyday interactions, the need for multilin- gual technologies to support efficient and effective information access and communication cannot be overemphasized. Computational modeling of language has been the focus of Natural Language Processing, a subdiscipline of Artificial Intelligence. One of the current challenges for this discipline is to design methodologies and algorithms that are cross- language in order to create multilingual technologies rapidly. The goal of this JAIR special issue on Cross-Language Algorithms and Applications (CLAA) is to present leading re- search in this area, with emphasis on developing unifying themes that could lead to the development of the science of multi- and cross-lingualism. In this introduction, we provide the reader with the motivation for this special issue and summarize the contributions of the papers that have been included. The selected papers cover a broad range of cross-lingual technologies including machine translation, domain and language adaptation for sentiment analysis, cross-language lexical resources, dependency parsing, information retrieval and knowledge representation. We anticipate that this special issue will serve as an invaluable resource for researchers interested in topics of cross-lingual natural language processing.

machine translation, proceedings, translation, (12 more...)

doi: 10.1613/jair.5022

AI Access Foundation

10973

Country:

Asia > India > Karnataka > Bengaluru (0.05)
Europe > Czechia > Prague (0.04)
Europe > Sweden > Uppsala County > Uppsala (0.04)
(18 more...)

Genre:

Overview (0.87)
Collection > Journal > Special Issue (0.77)
Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
(2 more...)

Wołk, Krzysztof, Koržinek, Danijel

Comparison and Adaptation of Automatic Evaluation Metrics for Quality Assessment of Re-Speaking

arXiv.org Machine LearningJan-12-2016

One of the main driving forces in Speech Technology, for the last several years, comes from the efforts of various groups and organizations tackling with the issue of disability, specifically deaf and hard of hearing people. Most notably, a long term effort by such organisations has lead to a plan by the European Commision to enable "Subtitling of 100% of programs in public TV all over the EU by 2020 with simple technical standards and consumer friendly rules" [15]. This ambitious task would not be possible to achieve without the aid of Speech Technology. While there has been a considerable improvement of quality of Automatic Speech Recognition (ASR) technology recently, many of the tasks present in real-life are simply beyond complete automation. On the other hand, there are tasks, which are also impossible to achieve by humans without the aid of ASR.

artificial intelligence, metric, natural language, (13 more...)

1601.02789

Country:

Europe (0.93)
North America > United States (0.28)

Genre: Research Report > Experimental Study (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Speech (0.68)

Firat, Orhan, Cho, Kyunghyun, Bengio, Yoshua

Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism

arXiv.org Machine LearningJan-5-2016

We propose multi-way, multilingual neural machine translation. The proposed approach enables a single neural translation model to translate between multiple languages, with a number of parameters that grows only linearly with the number of languages. This is made possible by having a single attention mechanism that is shared across all language pairs. We train the proposed multi-way, multilingual model on ten language pairs from WMT'15 simultaneously and observe clear performance improvements over models trained on only one language pair. In particular, we observe that the proposed model significantly improves the translation quality of low-resource language pairs.

machine learning, natural language, translation, (16 more...)

1601.01073

Country: North America > Canada (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Andreas, Jacob, Rabinovich, Maxim, Jordan, Michael I., Klein, Dan

On the Accuracy of Self-Normalized Log-Linear Models

Neural Information Processing SystemsDec-31-2015

Calculation of the log-normalizer is a major computational obstacle in applications of log-linear models with large output spaces. The problem of fast normalizer computation has therefore attracted significant attention in the theoretical and applied machine learning literature. In this paper, we analyze a recently proposed technique known as ``self-normalization'', which introduces a regularization term in training to penalize log normalizers for deviating from zero. This makes it possible to use unnormalized model scores as approximate probabilities. Empirical evidence suggests that self-normalization is extremely effective, but a theoretical understanding of why it should work, and how generally it can be applied, is largely lacking.We prove upper bounds on the loss in accuracy due to self-normalization, describe classes of input distributionsthat self-normalize easily, and construct explicit examples of high-variance input distributions. Our theoretical results make predictions about the difficulty of fitting self-normalized models to several classes of distributions, and we conclude with empirical validation of these predictions on both real and synthetic datasets.

likelihood gap, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.68)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Chorowski, Jan K., Bahdanau, Dzmitry, Serdyuk, Dmitriy, Cho, Kyunghyun, Bengio, Yoshua

Attention-Based Models for Speech Recognition

Neural Information Processing SystemsDec-31-2015

Recurrent sequence generators conditioned on input data through an attention mechanism have recently shown very good performance on a range of tasks including machine translation, handwriting synthesis and image caption generation. We extend the attention-mechanism with features needed for speech recognition. We show that while an adaptation of the model used for machine translation reaches a competitive 18.6\% phoneme error rate (PER) on the TIMIT phoneme recognition task, it can only be applied to utterances which are roughly as long as the ones it was trained on. We offer a qualitative explanation of this failure and propose a novel and generic method of adding location-awareness to the attention mechanism to alleviate this issue. The new method yields a model that is robust to long inputs and achieves 18\% PER in single utterances and 20\% in 10-times longer (repeated) utterances. Finally, we propose a change to the attention mechanism that prevents it from concentrating too much on single frames, which further reduces PER to 17.6\% level.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: Europe > Poland (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.88)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.70)

Wołk, Krzysztof, Marasek, Krzysztof

Unsupervised comparable corpora preparation and exploration for bi-lingual translation equivalents

arXiv.org Machine LearningDec-5-2015

The multilingual nature of the world makes translation a crucial requirement today. Parallel dictionaries constructed by humans are a widely-available resource, but they are limited and do not provide enough coverage for good quality translation purposes, due to out-of-vocabulary words and neologisms. This motivates the use of statistical translation systems, which are unfortunately dependent on the quantity and quality of training data. Such systems have a very limited availability especially for some languages and very narrow text domains. In this research we present our improvements to current comparable corpora mining methodologies by re- implementation of the comparison algorithms (using Needleman-Wunch algorithm), introduction of a tuning script and computation time improvement by GPU acceleration. Experiments are carried out on bilingual data extracted from the Wikipedia, on various domains. For the Wikipedia itself, additional cross-lingual comparison heuristics were introduced. The modifications made a positive impact on the quality and quantity of mined data and on the translation quality.

artificial intelligence, machine learning, natural language, (19 more...)

1512.01641

Country: Europe (0.68)

Genre: Research Report (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Wołk, Krzysztof, Marasek, Krzysztof

PJAIT Systems for the IWSLT 2015 Evaluation Campaign Enhanced by Comparable Corpora

arXiv.org Machine LearningDec-5-2015

In this paper, we attempt to improve Statistical Machine Translation (SMT) systems on a very diverse set of language pairs (in both directions): Czech - English, Vietnamese - English, French - English and German - English. To accomplish this, we performed translation model training, created adaptations of training settings for each language pair, and obtained comparable corpora for our SMT systems. Innovative tools and data adaptation techniques were employed. The TED parallel text corpora for the IWSLT 2015 evaluation campaign were used to train language models, and to develop, tune, and test the system. In addition, we prepared Wikipedia-based comparable corpora for use with our SMT system. This data was specified as permissible for the IWSLT 2015 evaluation. We explored the use of domain adaptation techniques, symmetrized word alignment models, the unsupervised transliteration models and the KenLM language modeling tool. To evaluate the effects of different preparations on translation results, we conducted experiments and used the BLEU, NIST and TER metrics. Our results indicate that our approach produced a positive impact on SMT quality.

artificial intelligence, corpora, natural language, (14 more...)

1512.01639

Country:

Europe (0.46)
North America > United States (0.28)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

arXiv.org Machine LearningNov-24-2015

Natural Language Understanding with Distributed Representation

Cho, Kyunghyun

As the name of the course suggests, this lecture note introduces readers to a neural network based approach to natural language understanding/processing. In order to make it as self-contained as possible, I spend much time on describing basics of machine learning and neural networks, only after which how they are used for natural languages is introduced. On the language front, I almost solely focus on language modelling and machine translation, two of which I personally find most fascinating and most fundamental to natural language understanding. After about a month of lectures and about 40 pages of writing this lecture note, I found this fascinating note [47] by Yoav Goldberg on neural network models for natural language processing. This note deals with wider topics on natural language processing with distributed representations in more details, and I highly recommend you to read it (hopefully along with this lecture note.)

artificial intelligence, machine learning, natural language, (20 more...)

1511.07916

Country:

Europe (1.00)
North America > United States > Texas (0.27)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education (1.00)
Government > Military (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

arXiv.org Artificial IntelligenceNov-20-2015

Empirical Study on Deep Learning Models for Question Answering

Yu, Yang, Zhang, Wei, Hang, Chung-Wei, Xiang, Bing, Zhou, Bowen

In this paper we explore deep learning models with memory component or attention mechanism for question answering task. We combine and compare three models, Neural Machine Translation, Neural Turing Machine, and Memory Networks for a simulated QA data set. This paper is the first one that uses Neural Machine Translation and Neural Turing Machines for solving QA tasks. Our results suggest that the combination of attention and memory have potential to solve certain QA problem.

machine learning, natural language, qa problem, (17 more...)

arXiv.org Artificial Intelligence

1510.07526

Country: North America > United States (0.29)

Genre: Research Report > New Finding (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)