AITopics | Nallapati, Ramesh

Plotting

Nallapati, Ramesh

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Improving Factual Consistency of Abstractive Summarization via Question Answering

Nan, Feng, Santos, Cicero Nogueira dos, Zhu, Henghui, Ng, Patrick, McKeown, Kathleen, Nallapati, Ramesh, Zhang, Dejiao, Wang, Zhiguo, Arnold, Andrew O., Xiang, Bing

arXiv.org Artificial IntelligenceMay-10-2021

A commonly observed problem with the state-of-the art abstractive summarization models is that the generated summaries can be factually inconsistent with the input documents. The fact that automatic summarization may produce plausible-sounding yet inaccurate summaries is a major concern that limits its wide application. In this paper we present an approach to address factual consistency in summarization. We first propose an efficient automatic evaluation metric to measure factual consistency; next, we propose a novel learning algorithm that maximizes the proposed metric during model training. Through extensive experiments, we confirm that our method is effective in improving factual consistency and even overall quality of the summaries, as judged by both automatic metrics and human evaluation.

air transportation, input document, law enforcement, (19 more...)

arXiv.org Artificial Intelligence

2105.04623

Country:

Europe > United Kingdom (0.28)
Asia > Middle East > UAE (0.28)

Genre: Research Report > New Finding (0.46)

Industry:

Transportation > Air (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.93)
Leisure & Entertainment > Sports > Boxing (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.64)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.46)

Add feedback

Transductive Learning for Abstractive News Summarization

Bražinskas, Arthur, Liu, Mengwen, Nallapati, Ramesh, Ravi, Sujith, Dreyer, Markus

arXiv.org Artificial IntelligenceApr-17-2021

Pre-trained language models have recently advanced abstractive summarization. These models are further fine-tuned on human-written references before summary generation in test time. In this work, we propose the first application of transductive learning to summarization. In this paradigm, a model can learn from the test set's input before inference. To perform transduction, we propose to utilize input document summarizing sentences to construct references for learning in test time. These sentences are often compressed and fused to form abstractive summaries and provide omitted details and additional context to the reader. We show that our approach yields state-of-the-art results on CNN/DM and NYT datasets. For instance, we achieve over 1 ROUGE-L point improvement on CNN/DM. Further, we show the benefits of transduction from older to more recent news. Finally, through human and automatic evaluation, we show that our summaries become more abstractive and coherent.

artificial intelligence, summarization, text processing, (17 more...)

arXiv.org Artificial Intelligence

2104.095

Country: North America > United States (0.28)

Genre: Research Report (1.00)

Industry: Energy > Power Industry > Utilities (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Entity-level Factual Consistency of Abstractive Text Summarization

Nan, Feng, Nallapati, Ramesh, Wang, Zhiguo, Santos, Cicero Nogueira dos, Zhu, Henghui, Zhang, Dejiao, McKeown, Kathleen, Xiang, Bing

arXiv.org Artificial IntelligenceFeb-17-2021

A key challenge for abstractive summarization is ensuring factual consistency of the generated summary with respect to the original document. For example, state-of-the-art models trained on existing datasets exhibit entity hallucination, generating names of entities that are not present in the source document. We propose a set of new metrics to quantify the entity-level factual consistency of generated summaries and we show that the entity hallucination problem can be alleviated by simply filtering the training data. In addition, we propose a summary-worthy entity classification task to the training process as well as a joint entity and summary generation approach, which yield further improvements in entity level metrics.

ground truth summary, machine translation, neural network, (15 more...)

arXiv.org Artificial Intelligence

2102.0913

Country:

Europe (1.00)
Asia (0.68)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Multi-passage BERT: A Globally Normalized BERT Model for Open-domain Question Answering

Wang, Zhiguo, Ng, Patrick, Ma, Xiaofei, Nallapati, Ramesh, Xiang, Bing

arXiv.org Artificial IntelligenceAug-21-2019

BERT model has been successfully applied to open-domain QA tasks. However, previous work trains BERT by viewing passages corresponding to the same question as independent training instances, which may cause incomparable scores for answers from different passages. To tackle this issue, we propose a multi-passage BERT model to globally normalize answer scores across all passages of the same question, and this change enables our QA model find better answers by utilizing more passages. In addition, we find that splitting articles into passages with the length of 100 words by sliding window improves performance by 4%. By leveraging a passage ranker to select high-quality passages, multi-passage BERT gains additional 2%. Experiments on four standard benchmarks showed that our multi-passage BERT outperforms all state-of-the-art models on all benchmarks.

inductive learning, multi-passage bert, neural network, (19 more...)

arXiv.org Artificial Intelligence

1908.08167

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.55)

Add feedback

Topic Modeling with Wasserstein Autoencoders

Nan, Feng, Ding, Ran, Nallapati, Ramesh, Xiang, Bing

arXiv.org Artificial IntelligenceJul-24-2019

We propose a novel neural topic model in the Wasserstein autoencoders (WAE) framework. Unlike existing variational autoencoder based models, we directly enforce Dirichlet prior on the latent document-topic vectors. We exploit the structure of the latent space and apply a suitable kernel in minimizing the Maximum Mean Discrepancy (MMD) to perform distribution matching. We discover that MMD performs much better than the Generative Adversarial Network (GAN) in matching high dimensional Dirichlet distribution. We further discover that incorporating randomness in the encoder output during training leads to significantly more coherent topics. To measure the diversity of the produced topics, we propose a simple topic uniqueness metric. Together with the widely used coherence measure NPMI, we offer a more wholistic evaluation of topic quality. Experiments on several real datasets show that our model produces significantly better topics than existing topic models.

government & the courts, law enforcement, w-lda, (39 more...)

arXiv.org Artificial Intelligence

1907.12374

Country:

Oceania (1.00)
North America > Canada (1.00)
Europe > United Kingdom (1.00)
(9 more...)

Genre: Research Report (1.00)

Industry:

Transportation > Ground (1.00)
Transportation > Air (1.00)
Media > Television (1.00)
(32 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

SenGen: Sentence Generating Neural Variational Topic Model

Nallapati, Ramesh, Melnyk, Igor, Kumar, Abhishek, Zhou, Bowen

arXiv.org Machine LearningAug-1-2017

We present a new topic model that generates documents by sampling a topic for one whole sentence at a time, and generating the words in the sentence using an RNN decoder that is conditioned on the topic of the sentence. We argue that this novel formalism will help us not only visualize and model the topical discourse structure in a document better, but also potentially lead to more interpretable topics since we can now illustrate topics by sampling representative sentences instead of bag of words or phrases. We present a variational auto-encoder approach for learning in which we use a factorized variational encoder that independently models the posterior over topical mixture vectors of documents using a feed-forward network, and the posterior over topic assignments to sentences using an RNN. Our preliminary experiments on two different datasets indicate early promise, but also expose many challenges that remain to be addressed.

artificial intelligence, neural network, posterior, (16 more...)

arXiv.org Machine Learning

1708.00308

Country: North America > United States (0.28)

Genre: Research Report (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

SummaRuNNer: A Recurrent Neural Network Based Sequence Model for Extractive Summarization of Documents

Nallapati, Ramesh (IBM Watson) | Zhai, Feifei (IBM Watson) | Zhou, Bowen (IBM Watson)

AAAI ConferencesFeb-14-2017

We present SummaRuNNer, a Recurrent Neural Network (RNN) based sequence model for extractive summarization of documents and show that it achieves performance better than or comparable to state-of-the-art. Our model has the additional advantage of being very interpretable, since it allows visualization of its predictions broken up by abstract features such as information content, salience and novelty. Another novel contribution of our work is abstractive training of our extractive model that can train on human generated reference summaries alone, eliminating the need for sentence-level extractive labels.

deep learning, neural network, summarization, (20 more...)

AAAI Conferences

Thirty-First AAAI Conference on Artificial Intelligence

Country: South America (0.14)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback