Discourse & Dialogue
The Role of Knowledge and Certainty in Understanding for Dialogue
Epstein, Susan L. (Hunter College and The Graduate Center of The City University of New York) | Passonneau, Rebecca (Columbia University) | Gordon, Joshua (Columbia University) | Ligorio, Tiziana (The Graduate School of The City University of New York)
As people engage in increasingly complex conversations with computers, the need for generality and flexibility in spoken dialogue systems becomes more apparent. This paper describes how three different spoken dialogue systems for the same task reason with knowledge and certainty as they seek to understand what people want. It advocates systems that exploit partial understanding, consider credibility, and are aware both of what they know and of their certainty that it matches their users’ intent.
Kernel Topic Models
Hennig, Philipp, Stern, David, Herbrich, Ralf, Graepel, Thore
We study a variation of this concept, in which the documents' mixture weight beliefs are replaced with squashed Gaussian distributions. This allows documents to be associated with elements of a Hilbert space, admitting kernel topic models (KTM), modelling temporal, spatial, hierarchical, social and other structure between documents. The main challenge is efficient approximate inference on the latent Gaussian. We present an approximate algorithm cast around a Laplace approximation in a transformed basis. The KTM can also be interpreted as a type of Gaussian process latent variable model, or as a topic model conditional on document features, uncovering links between earlier work in these areas.
Higher-Order Markov Tag-Topic Models for Tagged Documents and Images
Zeng, Jia, Feng, Wei, Cheung, William K., Li, Chun-Hung
This paper studies the topic modeling problem of tagged documents and images. Higher-order relations among tagged documents and images are major and ubiquitous characteristics, and play positive roles in extracting reliable and interpretable topics. In this paper, we propose the tag-topic models (TTM) to depict such higher-order topic structural dependencies within the Markov random field (MRF) framework. First, we use the novel factor graph representation of latent Dirichlet allocation (LDA)-based topic models from the MRF perspective, and present an efficient loopy belief propagation (BP) algorithm for approximate inference and parameter estimation. Second, we propose the factor hypergraph representation of TTM, and focus on both pairwise and higher-order relation modeling among tagged documents and images. Efficient loopy BP algorithm is developed to learn TTM, which encourages the topic labeling smoothness among tagged documents and images. Extensive experimental results confirm the incorporation of higher-order relations to be effective in enhancing the overall topic modeling performance, when compared with current state-of-the-art topic models, in many text and image mining tasks of broad interests such as word and link prediction, document classification, and tag recommendation.
A Comparison between Microblog Corpus and Balanced Corpus from Linguistic and Sentimental Perspectives
Tang, Yi-jie (National Taiwan University) | Li, Chang-Ye (National Taiwan University) | Chen, Hsin-Hsi (National Taiwan University)
While microblogging has gained popularity on the Internet, analyzing and processing short messages has become a challenging task in natural language processing. This paper analyzes the differences between Internet short messages (or “microtext”) and general articles by comparing the Plurk Corpus and the Sinica Balanced Corpus. Likelihood ratio and the tóngyìcícílín thesaurus are adopted to analyze the lexical semantics of frequent terms in each corpus. Furthermore, the NTUSD sentiment dictionary is used to compare the sentiment distribution of the two corpora. The result is also applied to sentiment transition analysis.
Domain Adaptation in Sentiment Analysis of Twitter
Peddinti, Viswa Mani Kiran (University of Southern California) | Chintalapoodi, Prakriti (University of Southern California)
This paper focuses on performing Sentiment Analysis of Twitter by adapting data from other domains, commonly referred to as Domain Adaptation. While we show that Domain Adaptation is useful in predicting sentiments, we propose different techniques to select an out-of-domain data source that would aid in Sentiment Analysis. Additionally, we suggest two iterative algorithms based on Expectation-Maximization (EM) and Rocchio SVM that filter noisy data during adaptation and train only on valid data. Finally, we explore a couple of metrics, Mutual Information and Cosine distance to measure similarity between different domains of data. We use Twitter and Blippr as data sources and perform binary sentiment (positive and negative sentiments) classification.
Unsupervised Discovery of Fine-Grained Topic Clusters in Twitter Posts
Markman, Vita (Independent Researcher)
This paper reports on a work in progress whose goal is to use Latent Dirichlet Allocation (LDA) to discover topic clusters within a small set of Twitter posts. Preliminary results indicate that micro-documents are amenable to topic clustering via LDA provided that a) only nouns and verbs are used; b) posts are “padded” with words of similar meaning to those used in the posts. These preliminary findings are consistent with the fact that probabilistic topic models look for word co-occurrences in documents and hence require that topic-indicative words appear together many times throughout the data sample. The results of this pilot study are to be extended to a larger data set of Twitter posts to see whether padding” can counteract the growing size of the data and the presence of numerous information-sparse posts.
The Stock Sonar — Sentiment Analysis of Stocks Based on a Hybrid Approach
Feldman, Ronen (The Hebrew University of Jerusalem) | Rosenfeld, Benjamin (Digital Trowel) | Bar-Haim, Roy (Digital Trowel) | Fresko, Moshe (Digital Trowel)
The Stock Sonar (TSS) is a stock sentiment analysis application based on a novel hybrid approach. While previous work focused on document level sentiment classification, or extracted only generic sentiment at the phrase level, TSS integrates sentiment dictionaries, phrase-level compositional patterns, and predicate-level semantic events. TSS generates precise in text sentiment tagging as well as sentiment-oriented event summaries for a given stock, which are also aggregated into sentiment scores. Hence, TSS allows investors to get the essence of thousands of articles every day and may help them to make timely, informed trading decisions. The extracted sentiment is also shown to improve the accuracy of an existing document-level sentiment classifier.
Transfer Learning for Multiple-Domain Sentiment Analysis — Identifying Domain Dependent/Independent Word Polarity
Yoshida, Yasuhisa (Nara Institute of Science and Technology) | Hirao, Tsutomu (NTT Communication Science Laboratories) | Iwata, Tomoharu (NTT Communication Science Laboratories) | Nagata, Masaaki (NTT Communication Science Laboratories) | Matsumoto, Yuji (Nara Institute of Science and Technology)
Sentiment analysis is the task of determining the attitude (positive or negative) of documents. While the polarity of words in the documents is informative for this task, polarity of some words cannot be determined without domain knowledge. Detecting word polarity thus poses a challenge for multiple-domain sentiment analysis. Previous approaches tackle this problem with transfer learning techniques, but they cannot handle multiple source domains and multiple target domains. This paper proposes a novel Bayesian probabilistic model to handle multiple source and multiple target domains. In this model, each word is associated with three factors: Domain label, domain dependence/independence and word polarity. We derive an efficient algorithm using Gibbs sampling for inferring the parameters of the model, from both labeled and unlabeled texts. Using real data, we demonstrate the effectiveness of our model in a document polarity classification task compared with a method not considering the differences between domains. Moreover our method can also tell whether each word's polarity is domain-dependent or domain-independent. This feature allows us to construct a word polarity dictionary for each domain.
Identifying Evaluative Sentences in Online Discussions
Zhai, Zhongwu (Tsinghua University) | Liu, Bing (University of Illinois at Chicago) | Zhang, Lei (University of Ilinois at Chicago) | Xu, Hua (Tsinghua University) | Jia, Peifa (Tsinghua University)
Much of opinion mining research focuses on product reviews because reviews are opinion-rich and contain little irrelevant information. However, this cannot be said about online discussions and comments. In such postings, the discussions can get highly emotional and heated with many emotional statements, and even personal attacks. As a result, many of the postings and sentences do not express positive or negative opinions about the topic being discussed. To find people’s opinions on a topic and its different aspects, which we call evaluative opinions, those irrelevant sentences should be removed. The goal of this research is thus to identify evaluative opinion sentences. A novel unsupervised approach is proposed to solve the problem, and our experimental results show that it performs well.
A Framework for Incorporating General Domain Knowledge into Latent Dirichlet Allocation Using First-Order Logic
Andrzejewski, David (Lawrence Livermore National Laboratory) | Zhu, Xiaojin (University of Wisconsin-Madison) | Craven, Mark (University of Wisconsin-Madison) | Recht, Benjamin (University of Wisconsin-Madison)
Topic models have been used successfully for a variety of problems, often in the form of application-specific extensions of the basic Latent Dirichlet Allocation (LDA) model. Because deriving these new models in order to encode domain knowledge can be difficult and time-consuming, we propose the Fold·all model, which allows the user to specify general domain knowledge in First-Order Logic (FOL). However, combining topic modeling with FOL can result in inference problems beyond the capabilities of existing techniques. We have therefore developed a scalable inference technique using stochastic gradient descent which may also be useful to the Markov Logic Network (MLN) research community. Experiments demonstrate the expresive power of Fold·all, as well as the scalability of our proposed inference method.