AITopics

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

arXiv.org Machine LearningJun-29-2018

Title Generation for Web Tables

Hancock, Braden, Lee, Hongrae, Yu, Cong

Descriptive titles provide crucial context for interpreting tables that are extracted from web pages and are a key component of table-based web applications. Prior approaches have attempted to produce titles by selecting existing text snippets associated with the table. These approaches, however, are limited by their dependence on suitable titles existing a priori. In our user study, we observe that the relevant information for the title tends to be scattered across the page, and often---more than 80% of time---does not appear verbatim anywhere in the page. We propose instead the application of a sequence-to-sequence neural network model as a more generalizable means of generating high-quality titles. This is accomplished by extracting many text snippets that have potentially relevant information to the table, encoding them into an input sequence, and using both copy and generation mechanisms in the decoder to balance relevance and readability of the generated title. We validate this approach with human evaluation on sample web tables and report that while sequence models with only a copy mechanism or only a generation mechanism are easily outperformed by simple selection-based baselines, the model with both capabilities outperforms them all, approaching the quality of crowdsourced titles while training on fewer than ten thousand examples. To the best of our knowledge, the proposed technique is the first to consider text-generation methods for table titles, and establishes a new state of the art.

information, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

1807.00099

Country:

Europe > United Kingdom (0.14)
Europe > Germany (0.04)
Asia > Japan (0.04)
(12 more...)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Sports > Baseball (0.68)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.70)

Soru, Tommaso, Marx, Edgard, Valdestilhas, André, Esteves, Diego, Moussallem, Diego, Publio, Gustavo

Neural Machine Translation for Query Construction and Composition

arXiv.org Artificial IntelligenceJun-27-2018

Research on question answering with knowledge base has recently seen an increasing use of deep architectures. In this extended abstract, we study the application of the neural machine translation paradigm for question parsing. We employ a sequence-to-sequence model to learn graph patterns in the SPARQL graph query language and their compositions. Instead of inducing the programs through question-answer pairs, we expect a semi-supervised approach, where alignments between questions and queries are built through templates. We argue that the coverage of language utterances can be expanded using late notable works in natural language generation.

machine learning, natural language, neural machine translation, (14 more...)

1806.10478

Country:

Europe > Germany > Saxony > Leipzig (0.06)
Europe > Sweden > Stockholm > Stockholm (0.05)
Europe > Germany > North Rhine-Westphalia > Cologne Region > Bonn (0.05)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Ebrahimi, Javid, Lowd, Daniel, Dou, Dejing

On Adversarial Examples for Character-Level Neural Machine Translation

arXiv.org Artificial IntelligenceJun-23-2018

Evaluating on adversarial examples has become a standard procedure to measure robustness of deep learning models. Due to the difficulty of creating white-box adversarial examples for discrete text input, most analyses of the robustness of NLP models have been done through black-box adversarial examples. We investigate adversarial examples for character-level neural machine translation (NMT), and contrast black-box adversaries with a novel white-box adversary, which employs differentiable string-edit operations to rank adversarial changes. We propose two novel types of attacks which aim to remove or change a word in a translation, rather than simply break the NMT. We demonstrate that white-box adversarial examples are significantly stronger than their black-box counterparts in different attack scenarios, which show more serious vulnerabilities than previously known. In addition, after performing adversarial training, which takes only 3 times longer than regular training, we can improve the model's robustness significantly.

adversarial example, machine learning, natural language, (20 more...)

1806.0903

Country:

Europe > France (0.04)
North America > United States > Oregon (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report (0.64)

Industry:

Information Technology > Security & Privacy (0.66)
Government > Military (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)

#artificialintelligenceJun-21-2018, 21:07:26 GMT

IFlytek, CIPG Will Build National AI Translator to Meet Rising Demand

China's top voice recognition firm iFlytek has penned a deal with China International Publishing Group to build a national artificial intelligence translator and keep up with rising demand. AI translations can lift the burden off human translators, who can barely keep up with requirements at government departments and companies looking to operate overseas, state-owned news agency Xinhua cited CIPG Deputy Director Fang Zhenghui as saying. The machine can translate Chinese into 33 languages, added Liu Qingfeng, president of Anhui-based iFlytek, saying it uses cutting-edge technology to improve the accuracy of machine translations. "When translation machines fail to recognize some special nouns or specific terms, human translators can monitor the process and help to polish the text," he said. "The machine [can] learn from these mistakes and improve its work next time."

human translator, machine translation, natural language, (3 more...)

Country: Asia > China > Anhui Province (0.30)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.65)

#artificialintelligenceJun-21-2018, 12:21:07 GMT

Salesforce research

Deep learning has significantly improved state-of-the-art performance for natural language processing tasks like machine translation, summarization, question answering, and text classification. Each of these tasks is typically studied with a specific metric, and performance is often measured on a set of standard benchmark datasets. This has led to the development of architectures designed specifically for those tasks and metrics, but it does not necessarily promote the emergence of general NLP models, those which can perform well across a wide variety of NLP tasks. In order to explore the possibility of such models as well as the tradeoffs that arise in optimizing for them, we introduce the Natural Language Decathlon (decaNLP). The goal of the Decathlon is to explore models that generalize to all ten tasks and investigate how such models differ from those trained for single tasks.

machine learning, natural language, text classification, (19 more...)

Industry: Information Technology > Software (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.38)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.35)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)
(3 more...)

McCann, Bryan, Keskar, Nitish Shirish, Xiong, Caiming, Socher, Richard

The Natural Language Decathlon: Multitask Learning as Question Answering

arXiv.org Artificial IntelligenceJun-20-2018

Deep learning has improved performance on many natural language processing (NLP) tasks individually. However, general NLP models cannot emerge within a paradigm that focuses on the particularities of a single metric, dataset, and task. We introduce the Natural Language Decathlon (decaNLP), a challenge that spans ten tasks: question answering, machine translation, summarization, natural language inference, sentiment analysis, semantic role labeling, zero-shot relation extraction, goal-oriented dialogue, semantic parsing, and commonsense pronoun resolution. We cast all tasks as question answering over a context. Furthermore, we present a new Multitask Question Answering Network (MQAN) jointly learns all tasks in decaNLP without any task-specific modules or parameters in the multitask setting. MQAN shows improvements in transfer learning for machine translation and named entity recognition, domain adaptation for sentiment analysis and natural language inference, and zero-shot capabilities for text classification. We demonstrate that the MQAN's multi-pointer-generator decoder is key to this success and performance further improves with an anti-curriculum training strategy. Though designed for decaNLP, MQAN also achieves state of the art results on the WikiSQL semantic parsing task in the single-task setting. We also release code for procuring and processing data, training and evaluating models, and reproducing all experiments for decaNLP.

machine learning, natural language, question answering, (16 more...)

1806.0873

Country:

North America > United States > California (0.14)
Oceania > Australia > South Australia (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Europe > Germany > North Rhine-Westphalia > Upper Bavaria > Munich (0.04)

Genre: Research Report (1.00)

Industry:

Media > Film (0.46)
Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
(2 more...)

McCann, Bryan, Bradbury, James, Xiong, Caiming, Socher, Richard

Learned in Translation: Contextualized Word Vectors

arXiv.org Artificial IntelligenceJun-20-2018

Computer vision has benefited from initializing multiple deep layers with weights pretrained on large supervised training sets like ImageNet. Natural language processing (NLP) typically sees initialization of only the lowest layer of deep models with pretrained word vectors. In this paper, we use a deep LSTM encoder from an attentional sequence-to-sequence model trained for machine translation (MT) to contextualize word vectors. We show that adding these context vectors (CoVe) improves performance over using only unsupervised word and character vectors on a wide variety of common NLP tasks: sentiment analysis (SST, IMDb), question classification (TREC), entailment (SNLI), and question answering (SQuAD). For fine-grained sentiment analysis and entailment, CoVe improves performance of our baseline models to the state of the art.

artificial intelligence, machine learning, natural language, (18 more...)

1708.00107

Country:

North America > United States > Oregon > Multnomah County > Portland (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceJun-19-2018, 22:40:53 GMT

AI Weekly: Google's research center in Ghana won't be the last AI lab in Africa

This year, we have seen an acceleration of Silicon Valley tech giants opening AI research labs around the world as they seek to gain traction among researchers and fulfill their global ambitions. In the past six months or so, Google brought labs to China and France, Facebook opened labs in Pittsburgh and Seattle, and Microsoft announced plans to open labs near universities in Berkeley, California and Melbourne, Australia. This trend shows no signs of slowing down. Last month, Samsung announced labs in Cambridge, Moscow, and Toronto. This week, Nvidia announced plans to open a new lab in Toronto, while Google shared plans to open a lab in Accra, Ghana, Google's first in Africa and perhaps the first of any tech giant in Africa.

artificial intelligence, machine learning, natural language, (18 more...)

Country:

North America > Canada > Ontario > Toronto (0.46)
Africa > Ghana > Greater Accra > Accra (0.27)
Oceania > Australia > Victoria > Melbourne (0.25)
(11 more...)

Genre: Press Release (0.56)

Industry:

Information Technology (1.00)
Government > Regional Government (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.74)
Information Technology > Artificial Intelligence > Robots (0.73)
Information Technology > Communications > Social Media (0.59)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.49)

The Independent - TechJun-19-2018, 15:55:07 GMT

Google Translate: How does the search giant's multilingual interpreter actually work?

Google Translate has become the internet's go-to resource for short, quick translations from foreign languages. The service was first launched in April 2006, seeing off early competition from the likes of Babel Fish. It now boasts more than 500m users daily worldwide, offering 103 languages. But how exactly does it work? How does Google News actually work?

artificial intelligence, natural language, robot, (15 more...)

The Independent - Tech

AI-Alerts: 2018 > 2018-06 > AAAI AI-Alert for Jun 26, 2018 (1.00)

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.22)
North America > United States > California > Los Angeles County > Los Angeles (0.16)
Asia > South Korea > Seoul > Seoul (0.07)
(11 more...)

Industry:

Automobiles & Trucks > Manufacturer (1.00)
Information Technology > Robotics & Automation (0.99)
Transportation > Ground > Road (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.72)