AITopics | Discourse & Dialogue

Collaborating Authors

Discourse & Dialogue

Understanding Language in Conversations "The problems addressed in discourse research aim to answer two general kinds of questions: (1) what information is contained in extended sequences of utterances that goes beyond the meaning of the individual utterances themselves? (2) how does the context in which an utterance is used affect the meaning of the individual utterances, or parts of them?"
– Barbara Grosz. Overview of Chapter 6: Discourse and Dialogue, Survey of the State of the Art in Human Language Technology (1996).

News Overviews Instructional Materials AI-Alerts Classics

Gaussian Hierarchical Latent Dirichlet Allocation: Bringing Polysemy Back

Yoshida, Takahiro, Hisano, Ryohei, Ohnishi, Takaaki

arXiv.org Machine LearningFeb-25-2020

Topic models are widely used to discover the latent representation of a set of documents. The two canonical models are latent Dirichlet allocation, and Gaussian latent Dirichlet allocation, where the former uses multinomial distributions over words, and the latter uses multivariate Gaussian distributions over pre-trained word embedding vectors as the latent topic representations, respectively. Compared with latent Dirichlet allocation, Gaussian latent Dirichlet allocation is limited in the sense that it does not capture the polysemy of a word such as ``bank.'' In this paper, we show that Gaussian latent Dirichlet allocation could recover the ability to capture polysemy by introducing a hierarchical structure in the set of topics that the model can use to represent a given document. Our Gaussian hierarchical latent Dirichlet allocation significantly improves polysemy detection compared with Gaussian-based models and provides more parsimonious topic representations compared with hierarchical latent Dirichlet allocation. Our extensive quantitative experiments show that our model also achieves better topic coherence and held-out document predictive accuracy over a wide range of corpus and word embedding vectors.

artificial intelligence, natural language, text processing, (16 more...)

arXiv.org Machine Learning

2002.10855

Country:

North America > United States > New York > New York County > New York City (0.05)
Asia > Middle East > Jordan (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(6 more...)

Genre: Research Report (0.40)

Industry:

Banking & Finance (0.68)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)

Add feedback

Gated Mechanism for Attention Based Multimodal Sentiment Analysis

Kumar, Ayush, Vepa, Jithendra

arXiv.org Machine LearningFeb-21-2020

ABSTRACT different granularities [3, 9] or use a cross interaction block that couple the features from different modalities [10, 6]. It is imperative that all modalities in multimodal interactions and 3. Fusion of unimodal and cross Therefore, to learn better cross modal information, we introduce 1.6% and 1.34% absolute improvement over current state-ofthe-art. Furthermore, to capture long term dependencies across 1. INTRODUCTION These are categorised into three types, 1. Methods that learn the modalities independently and fuse the In our proposed model, we aim to learn the interaction between [3, 4], and 3. Methods that explicitly learn contributions Personal use of this material is permitted. Multimodal sentiment analysis provides an opportunity to 2.1. M T V H T W H T V; W R d d (3) (U 1, U 2,..., U u) for a Text modality can be defined as: Cross attentive representations of Text (C V T R u d) and H T Bi-GRU(U 1, U 2,..., U u) (1) Video (C T V R u d) can be represented as: Subscript T denotes Text modality, A and V represent Audio As much as there is an opportunity to leverage cross modal interactions, representations is employed.

interaction, modality, sentiment analysis, (14 more...)

arXiv.org Machine Learning

2003.01043

Country: Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.75)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.75)

Add feedback

Semi-Supervised Class Discovery

Nixon, Jeremy, Liu, Jeremiah, Berthelot, David

arXiv.org Machine LearningFeb-21-2020

One promising approach to dealing with datapoints that are outside of the initial training distribution (OOD) is to create new classes that capture similarities in the datapoints previously rejected as uncategorizable. Systems that generate labels can be deployed against an arbitrary amount of data, discovering classification schemes that through training create a higher quality representation of data. We introduce the Dataset Reconstruction Accuracy, a new and important measure of the effectiveness of a model's ability to create labels. We introduce benchmarks against this Dataset Reconstruction metric. We apply a new heuristic, class learnability, for deciding whether a class is worthy of addition to the training dataset. We show that our class discovery system can be successfully applied to vision and language, and we demonstrate the value of semi-supervised learning in automatically discovering novel classes.

accuracy, dataset, learning, (12 more...)

arXiv.org Machine Learning

2002.0348

Country:

North America > United States > California > Santa Clara County > Mountain View (0.04)
Asia > Middle East > Qatar > Ad-Dawhah > Doha (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.94)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.68)

Add feedback

Analyzing Customer Support on Social Media - Qualetics Data Machines

#artificialintelligenceFeb-19-2020, 12:16:08 GMT

The goal of this study is to analyze the queries raised by customers on a particular social media platform by analyzing their interactions with the customer support and provide incisive insights to perform sentiment analysis. We performed exploratory data analysis to extract insights from the data. With Deep Learning tools like NLTK, sentiment analysis was performed to understand the positive, negative, and neutral sentiments of the customers of a brand. Machine Learning was used to identify the frequency of similar text appearances. Deep learning algorithms were used to understand the customer queries and the average time taken by the respective company's social customer support team in addressing the queries.

analyzing customer support, qualetic data machine, social media, (2 more...)

#artificialintelligence

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.58)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.58)

Add feedback

Sentiment Analysis with the bag-of-words

#artificialintelligenceFeb-19-2020, 12:03:44 GMT

As a precursor to research about Sentiment Analysis with Text Classifiers (Naive Bayes, Maximum Entropy, SVM), Sentiment Analysis with bag-of-words was done and Positive / Negative Sentiment was detected with an accuracy of 60%. This is when only unigrams are used. This percentage will be much when bigrams or trigrams are used (in a next blog-post). See the results at: part 1: http://tinyurl.com/gnlfqqm

sentiment analysis, tinyurl

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.73)

Add feedback

Non-Autoregressive Dialog State Tracking

Le, Hung, Socher, Richard, Hoi, Steven C. H.

arXiv.org Artificial IntelligenceFeb-19-2020

Recent efforts in Dialogue State Tracking (DST) for task-oriented dialogues have progressed toward open-vocabulary or generation-based approaches where the models can generate slot value candidates from the dialogue history itself. These approaches have shown good performance gain, especially in complicated dialogue domains with dynamic slot values. However, they fall short in two aspects: (1) they do not allow models to explicitly learn signals across domains and slots to detect potential dependencies among (domain, slot) pairs; and (2) existing models follow auto-regressive approaches which incur high time cost when the dialogue evolves over multiple domains and multiple turns. In this paper, we propose a novel framework of Non-Autoregressive Dialog State Tracking (NADST) which can factor in potential dependencies among domains and slots to optimize the models towards better prediction of dialogue states as a complete set rather than separate slots. In particular, the non-autoregressive nature of our method not only enables decoding in parallel to significantly reduce the latency of DST for real-time dialogue response generation, but also detect dependencies among slots at token level in addition to slot and domain level. Our empirical results show that our model achieves the state-of-the-art joint accuracy across all domains on the MultiWOZ 2.1 corpus, and the latency of our model is an order of magnitude lower than the previous state of the art as the dialogue history extends over time.

dialog state, dialogue history, dialogue state, (13 more...)

arXiv.org Artificial Intelligence

2002.08024

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
(8 more...)

Genre: Research Report > New Finding (0.34)

Industry:

Consumer Products & Services (0.46)
Transportation (0.30)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Asynchronous Distributed Learning of Topic Models

Smyth, Padhraic, Welling, Max, Asuncion, Arthur U.

Neural Information Processing SystemsFeb-15-2020, 19:43:56 GMT

Distributed learning is a problem of fundamental interest in machine learning and cognitive science. In this paper, we present asynchronous distributed learning algorithms for two well-known unsupervised learning frameworks: Latent Dirichlet Allocation (LDA) and Hierarchical Dirichlet Processes (HDP). In the proposed approach, the data are distributed across P processors, and processors independently perform Gibbs sampling on their local data and communicate their information in a local asynchronous manner with other processors. We demonstrate that our asynchronous algorithms are able to learn global topic models that are statistically as accurate as those learned by the standard LDA and HDP samplers, but with significant improvements in computation time and memory. We show speedup results on a 730-million-word text corpus using 32 processors, and we provide perplexity results for up to 1500 virtual processors.

asynchronous, processor, topic model, (3 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.65)

Add feedback

Spatial Latent Dirichlet Allocation

Wang, Xiaogang, Grimson, Eric

Neural Information Processing SystemsFeb-15-2020, 06:11:10 GMT

In recent years, the language model Latent Dirichlet Allocation (LDA), which clusters co-occurring words into topics, has been widely appled in the computer vision field. However, many of these applications have difficulty with modeling the spatial and temporal structure among visual words, since LDA assumes that a document is a bag-of-words''. It is also critical to properly design words'' and "documents" when using a language model to solve vision problems. In this paper, we propose a topic model Spatial Latent Dirichlet Allocation (SLDA), which better encodes spatial structure among visual words that are essential for solving many vision problems. The spatial information is not encoded in the value of visual words but in the design of documents.

latent dirichlet allocation, spatial latent dirichlet allocation, visual word, (3 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)

Add feedback

Supervised Topic Models

Mcauliffe, Jon D., Blei, David M.

Neural Information Processing SystemsFeb-15-2020, 04:27:49 GMT

We introduce supervised latent Dirichlet allocation (sLDA), a statistical model of labelled documents. We derive a maximum-likelihood procedure for parameter estimation, which relies on variational approximations to handle intractable posterior expectations. Prediction problems motivate this research: we use the fitted model to predict response values for new documents. We test sLDA on two real-world problems: movie ratings predicted from reviews, and web page popularity predicted from text descriptions. We illustrate the benefits of sLDA versus modern regularized regression, as well as versus an unsupervised LDA analysis followed by a separate regression.

regression, slda, supervised topic model

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.78)

Add feedback

Parallel Inference for Latent Dirichlet Allocation on Graphics Processing Units

Yan, Feng, Xu, Ningyi, Qi, Yuan

Neural Information Processing SystemsFeb-15-2020, 04:11:52 GMT

The recent emergence of Graphics Processing Units (GPUs) as general-purpose parallel computing devices provides us with new opportunities to develop scalable learning methods for massive data. In this work, we consider the problem of parallelizing two inference methods on GPUs for latent Dirichlet Allocation (LDA) models, collapsed Gibbs sampling (CGS) and collapsed variational Bayesian (CVB). To address limited memory constraints on GPUs, we propose a novel data partitioning scheme that effectively reduces the memory cost. Furthermore, the partitioning scheme balances the computational cost on each multiprocessor and enables us to easily avoid memory access conflicts. We also use data streaming to handle extremely large datasets. Extensive experiments showed that our parallel inference methods consistently produced LDA models with the same predictive power as sequential training methods did but with 26x speedup for CGS and 196x speedup for CVB on a GPU with 30 multiprocessors; actually the speedup is almost linearly scalable with the number of multiprocessors available.

dirichlet allocation, graphic processing unit, latent dirichlet allocation, (3 more...)

Neural Information Processing Systems

Genre: Research Report (0.44)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.89)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.65)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.65)

Add feedback