AITopics

Twenty-Third International Joint Conference on Artificial Intelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.60)

AAAI ConferencesJul-9-2013

A Topic-Based Coherence Model for Statistical Machine Translation

Xiong, Deyi (Soochow University) | Zhang, Min (Soochow University)

Coherence that ties sentences of a text into a meaningfully connected structure is of great importance to text generation and translation. In this paper, we propose a topic-based coherence model to produce coherence for document translation, in terms of the continuity of sentence topics in a text. We automatically extract a coherence chain for each source text to be translated. Based on the extracted source coherence chain, we adopt a maximum entropy classifier to predict the target coherence chain that defines a linear topic structure for the target document. The proposed topic-based coherence model then uses the predicted target coherence chain to help decoder select coherent word/phrase translations. Our experiments show that incorporating the topic-based coherence model into machine translation achieves substantial improvement over both the baseline and previous methods that integrate document topics rather than coherence chains into machine translation.

artificial intelligence, natural language, statistical machine translation, (1 more...)

Twenty-Seventh AAAI Conference on Artificial Intelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.80)

Artificial Intelligence on Mobile Devices: An Introduction to the Special Issue

Yang, Qiang (Huawei Noah’s Ark Lab) | Zhao, Feng (Microsoft Research Asia)

AI MagazineJul-5-2013

We will see more and more applications of AI on the mobile devices. This special issue of AI Magazine is devoted to some exemplary works of AI on mobile devices. We include four works that range from mobile activity recognition and air-quality detection to machine translation and image compression. These works were chosen from a variety of sources, including the International Joint Conference on Artificial Intelligence 2011 Special Track on Integrated and Embedded AI Systems, held in Barcelona, Spain, in July 2011.

artificial intelligence, mobile device, natural language, (15 more...)

Country:

Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.25)
North America > United States > Maryland (0.15)

Genre: Collection > Journal > Special Issue (0.69)

Industry: Information Technology (0.49)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.37)

Speaking Louder than Words with Pictures Across Languages

Finch, Andrew (NICT) | Song, Wei (Canon Inc.) | Tanaka-Ishii, Kumiko (Kyushu University) | Sumita, Eiichiro (NICT)

AI MagazineJul-5-2013

In this article, we investigate the possibility of cross-language communication using a synergy of words and pictures on mobile devices. Communicating with only pictures is in itself a very powerful strategy, but is limited in expressiveness. On the other hand, words can express everything you could wish to say, but they are cumbersome to work with on mobile devices, and need to be translated in order for their meaning to be understood. Automatic translations can contain errors that pervert the communication process, and this may undermine the users’ confidence when expressing themselves across language barriers. Our idea is to create a user interface for cross-language communication that uses pictures as the primary mode of input, and words to express the detailed meaning. This interface creates a visual process of communication that occurs on two heterogeneous channels that can support each other. We implemented this user interface as application on the Apple iPad tablet, and performed a set of experiments to determine its usefulness as a translation aid for travellers.

artificial intelligence, natural language, sequence, (17 more...)

Country: Asia > Japan > Honshū (0.14)

Industry: Education (0.46)

Technology:

Information Technology > Human Computer Interaction > Interfaces (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Krug, Wayne (Language Computer Corporation) | Tomlinson, Marc T. (Language Computer Corporation)

Automated Non-Content Word List Generation Using hLDA

AAAI ConferencesMay-19-2013

In this paper, we present a language-independent method for the automatic, unsupervised extraction of non-content words from a corpus of documents. This method permits the creation of word lists that may be used in place of traditional function word lists in various natural language processing tasks. As an example we generated lists of words from a corpus of English, Chinese, and Russian posts extracted from Wikipedia articles and Wikipedia Wikitalk discussion pages. We applied these lists to the task of authorship attribution on this corpus to compare the effectiveness of lists of words extracted with this method to expert-created function word lists and frequent word lists (a common alternative to function word lists). hLDA lists perform comparably to frequent word lists. The trials also show that corpus-derived lists tend to perform better than more generic lists, and both sets of generated lists significantly outperformed the expert lists. Additionally, we evaluated the performance of an English expert list on machine translations of our Chinese and Russian documents, showing that our method also outperforms this alternative.

automated non-content word list generation, hlda

The Twenty-Sixth International FLAIRS Conference

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.60)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.53)

Applying Automated Language Translation at a Global Enterprise Level

Rychtyckyj, Nestor (Ford Motor Company) | Plesco, Craig (Ford Motor Company)

AI MagazineApr-3-2013

In 2007 we presented a paper that described the application of Natural Language Processing (NLP) and Machine Translation (MT) for the automated translation of process build instructions from English to other languages to support Ford's assembly plants in non-English speaking countries. This project has continued to evolve with the addition of new languages and improvements to the translation process. However, we discovered that there was a large demand for automated language translation across all of Ford Motor Company and we decided to expand the scope of our project to address these requirements. This paper will describe our efforts to meet all of Ford's internal translation requirements with AI and MT technology and focus on the challenges and lessons that we learned from applying advanced technology across an entire corporation.

artificial intelligence, machine translation, natural language, (6 more...)

Industry: Automobiles & Trucks > Manufacturer (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Applying Automated Language Translation at a Global Enterprise Level

Rychtyckyj, Nestor (Ford Motor Company) | Plesco, Craig (Ford Motor Company)

AI MagazineApr-3-2013

In 2007 we presented a paper that described the application of Natural Language Processing (NLP) and Machine Translation (MT) for the automated translation of process build instructions from English to other languages to support Ford’s assembly plants in non-English speaking countries. This project has continued to evolve with the addition of new languages and improvements to the translation process. However, we discovered that there was a large demand for automated language translation across all of Ford Motor Company and we decided to expand the scope of our project to address these requirements. This paper will describe our efforts to meet all of Ford’s internal translation requirements with AI and MT technology and focus on the challenges and lessons that we learned from applying advanced technology across an entire corporation.

artificial intelligence, natural language, translation, (15 more...)

Country:

Europe (0.93)
North America > United States > Michigan (0.28)

Industry: Automobiles & Trucks > Manufacturer (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Federmann, Christian (German Research Center for Artificial Intelligence)

Multi-Engine Machine Translation as a Lifelong Machine Learning Problem

AAAI ConferencesMar-21-2013

We describe an approach for multi-engine machine translation that uses machine learning methods to train one or several classifiers for a given set of candidate translations. Contrary to existing approaches in quality estimation which only consider a single translation at a time, we explicitly model pairwise comparison with our feature vectors. We discuss several challenges our method is facing and discuss how lifelong machine learning could be applied to resolve these. We also show how the proposed architecture can be extended to allow human feedback to be included into the training process, improving the system's selection process over time.

artificial intelligence, lifelong machine learning problem, natural language, (1 more...)

2013 AAAI Spring Symposium Series

Industry: Education > Focused Education > Special Education (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.60)

Costa-jussà, M. R., Henríquez, C. A., Banchs, R. E.

Evaluating Indirect Strategies for Chinese-Spanish Statistical Machine Translation

Journal of Artificial Intelligence ResearchDec-31-2012

Although, Chinese and Spanish are two of the most spoken languages in the world, not much research has been done in machine translation for this language pair. This paper focuses on investigating the state-of-the-art of Chinese-to-Spanish statistical machine translation (Smt), which nowadays is one of the most popular approaches to machine translation. For this purpose, we report details of the available parallel corpus which are Basic Traveller Expressions Corpus (Btec), Holy Bible and United Nations (Un). Additionally, we conduct experimental work with the largest of these three corpora to explore alternative Smt strategies by means of using a pivot language. Three alternatives are considered for pivoting: cascading, pseudo-corpus and triangulation. As pivot language, we use either English, Arabic or French. Results show that, for a phrase-based Smt system, English is the best pivot language between Chinese and Spanish. We propose a system output combination using the pivot strategies which is capable of outperforming the direct translation strategy. The main objective of this work is motivating and involving the research community to work in this important pair of languages given their demographic impact.

machine translation, pivot language, translation, (14 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.3786

AI Access Foundation

10794

Journal of Artificial Intelligence Research

Country:

North America > United States > California > Los Angeles County > El Segundo (0.04)
Europe > Czechia > Prague (0.04)
Asia > Singapore (0.04)
(11 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.68)

Industry: Government > Intergovernmental Programs (0.35)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Ramírez, Jessica C., Matsumoto, Yuji

A Rule-Based Approach For Aligning Japanese-Spanish Sentences From A Comparable Corpora

arXiv.org Artificial IntelligenceNov-19-2012

The performance of a Statistical Machine Translation System (SMT) system is proportionally directed to the quality and length of the parallel corpus it uses. However for some pair of languages there is a considerable lack of them. The long term goal is to construct a Japanese-Spanish parallel corpus to be used for SMT, whereas, there are a lack of useful Japanese-Spanish parallel Corpus. To address this problem, In this study we proposed a method for extracting Japanese-Spanish Parallel Sentences from Wikipedia using POS tagging and Rule-Based approach. The main focus of this approach is the syntactic features of both languages. Human evaluation was performed over a sample and shows promising results, in comparison with the baseline.

artificial intelligence, machine translation, natural language, (15 more...)

arXiv.org Artificial Intelligence

1211.4488

Country:

North America > United States (0.14)
Asia > Taiwan (0.14)
Asia > Japan (0.14)
Asia > India (0.14)

Genre: Research Report > New Finding (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.90)