Discourse & Dialogue
Discourse Structure Effects on the Global Coherence of Texts
Sagi, Eyal (Northwestern University)
Many theories of discourse structure rely on the idea that the segments comprising the discourse are linked through inferred relations such as causality and temporal contiguity. These theories suggest that the resulting discourse is represented hierarchically. Two experiments examine some of the implications of these hierarchical structures on the perceived coherence of texts. Experiment 1 shows that texts with more levels to their hierarchical structure are judged to be more coherent. Experiment 2 demonstrates that these effects are sensitive to the genre of the text. Specifically, narratives seem to be more affected by manipulation of the discourse structure than procedural texts.
A Platform for Human-Robot Dialog Systems Research
Nielsen, Rodney D. (University of Colorado at Boulder) | Voyles, Richard (University of Denver) | Bolanos, Daniel (Boulder Language Technologies) | Mahoor, Mohammad H. (University of Denver) | Pace, Wilson D. (University of Colorado Denver Anschutz Medical Campus) | Siek, Katie A. (University of Colorado at Boulder) | Ward, Wayne H. (Boulder Language Technologies)
Policy Activation for Open-Ended Dialogue Management
Lison, Pierre (German Research Centre for Artificial Intelligence (DFKI GmbH)) | Kruijff, Geert-Jan M. (German Research Centre for Artificial Intelligence (DFKI)
An important difficulty in developing spoken dialogue systems for robots is the open-ended nature of most interactions. Robotic agents must typically operate in complex, continuously changing environments which are difficult to model and do not provide any clear, predefined goal. Directly capturing this complexity in a single, large dialogue policy is thus inadequate. This paper presents a new approach which tackles the complexity of open-ended interactions by breaking it into a set of small, independent policies, which can be activated and deactivated at runtime by a dedicated mechanism. The approach is currently being implemented in a spoken dialogue system for autonomous robots.
Grounding New Words on the Physical World in Multi-Domain Human-Robot Dialogues
Nakano, Mikio (Honda Research Institute Japan Co., Ltd.) | Iwahashi, Naoto (ATR Media Information Science Research Laboratories / National Institute of Information and Communications Technology) | Nagai, Takayuki (University of Electro-Communications) | Sumii, Taisuke (ATR Media Information Science Research Laboratories / Kyoto Institute of Technology) | Zuo, Xiang (ATR Media Information Science Research Laboratories / Kyoto Institute of Technology) | Taguchi, Ryo (ATR Media Information Science Research Laboratories / Nagoya Institute of Technology) | Nose, Takashi (ATR Media Information Science Research Laboratories / Tokyo Institute of Technology) | Mizutani, Akira (University of Electro-Communications) | Nakamura, Tomoaki (University of Electro-Communications) | Attamim, Muhanmad (University of Electro-Communications) | Narimatsu, Hiromi (University of Electro-Communications) | Funakoshi, Kotaro (Honda Research Institute Japan Co., Ltd.) | Hasegawa, Yuji (Honda Research Institute Japan Co., Ltd.)
This paper summarizes our ongoing project on developing an architecture for a robot that can acquire new words and their meanings while engaging in multi-domain dialogues. These two functions are crucial in making conversational service robots work in real tasks in the real world. Household robots and office robots need to be able to work in multiple task domains and they also need to engage in dialogues in multiple domains corresponding to those task domains. Lexical acquisition is necessary because speech understanding cannot be done without enough knowledge on words that are possibly spoken in the task domain. Our architecture is based on a multi-expert model in which multiple domain experts are employed and one of them is selected based on the user utterance and the situation to engage in the control of the dialogue and physical behaviors. We incorporate experts that have an ability to acquire new lexical entries and their meanings grounded on the physical world through spoken interactions. By appropriately selecting those experts, lexical acquisition in multi-domain dialogues becomes possible. An example robotic system based on this architecture that can acquire object names and location names demonstrates the viability of the architecture.
SenticNet: A Publicly Available Semantic Resource for Opinion Mining
Cambria, Erik (University of Stirling) | Speer, Robyn (Massachusetts Institute of Technology) | Havasi, Catherine (Massachusetts Institute of Technology) | Hussain, Amir (University of Stirling)
Today millions of web-users express their opinions about many topics through blogs, wikis, fora, chats and social networks. For sectors such as e-commerce and e-tourism, it is very useful to automatically analyze the huge amount of social information available on the Web, but the extremely unstructured nature of these contents makes it a difficult task. SenticNet is a publicly available resource for opinion mining built exploiting AI and Semantic Web techniques. It uses dimensionality reduction to infer the polarity of common sense concepts and hence provide a public resource for mining opinions from natural language text at a semantic, rather than just syntactic, level.
Sentiment Extraction: Integrating Statistical Parsing, Semantic Analysis, and Common Sense Reasoning
Shastri, Lokendra (Infosys Technologies Limited) | Parvathy, Anju G. (Infosys Technologies Limited) | Kumar, Abhishek (Infosys Technologies Limited) | Wesley, John (Infosys Technologies Limited) | Blakrishnan, Rajesh (Infosys Technologies Limited)
Much of the ongoing explosion of digital content is in the form of text. This content is a virtual gold-mine of information that can inform a range of social, governmental, and business decisions. For example, using content available on blogs and social networking sites businesses can find out what its customers are saying about their products and services. In the digital age where customer is king, the business value of ascertaining consumer sentiment cannot be overstated. People express sentiments in myriad ways. At times, they use simple, direct assertions, but most often they use sentences involving comparisons, conjunctions expressing multiple and possibly opposing sentiments about multiple features and entities,and pronominal references whose resolution requires discourse level context. Frequently people use abbreviations, slang, SMSese, idioms and metaphors. Understanding the latter also requires common sense reasoning. In this paper, we present iSEE, a fully implemented sentiment extraction engine, which makes use of statistical methods, classical NLU techniques, common sense reasoning, and probabilistic inference to extract entity and feature specific sentiment from complex sentences and dialog. Most of the components of iSEE are domain independent and the system can be generalized to new domains by simply adding domain relevant lexicons.
Sentiment Analysis with Global Topics and Local Dependency
Li, Fangtao (Tsinghua University) | Huang, Minlie (Tsinghua University) | Zhu, Xiaoyan (Tsinghua University)
With the development of Web 2.0, sentiment analysis has now become a popular research problem to tackle. Recently, topic models have been introduced for the simultaneous analysis for topics and the sentiment in a document. These studies, which jointly model topic and sentiment, take the advantage of the relationship between topics and sentiment, and are shown to be superior to traditional sentiment analysis tools. However, most of them make the assumption that, given the parameters, the sentiments of the words in the document are all independent. In our observation, in contrast, sentiments are expressed in a coherent way. The local conjunctive words, such as โandโ or โbutโ, are often indicative of sentiment transitions. In this paper, we propose a major departure from the previous approaches by making two linked contributions. First, we assume that the sentiments are related to the topic in the document, and put forward a joint sentiment and topic model, i.e. Sentiment-LDA. Second, we observe that sentiments are dependent on local context. Thus, we further extend the Sentiment-LDA model to Dependency-Sentiment-LDA model by relaxing the sentiment independent assumption in Sentiment-LDA. The sentiments of words are viewed as a Markov chain in Dependency-Sentiment-LDA. Through experiments, we show that exploiting the sentiment dependency is clearly advantageous, and that the Dependency-Sentiment-LDA is an effective approach for sentiment analysis.
What Is an Opinion About? Exploring Political Standpoints Using Opinion Scoring Model
Chen, Bi (Pennsylvania State University) | Zhu, Leilei (Pennsylvania State University) | Kifer, Daniel (Pennsylvania State University) | Lee, Dongwon (Pennsylvania State University)
In this paper, we propose a generative model to automatically discover the hidden associations between topics words and opinion words. By applying those discovered hidden associations, we construct the opinion scoring models to extract statements which best express opinionistsโ standpoints on certain topics. For experiments, we apply our model to the political area. First, we visualize the similarities and dissimilarities between Republican and Democratic senators with respect to various topics. Second, we compare the performance of the opinion scoring models with 14 kinds of methods to find the best ones. We find that sentences extracted by our opinion scoring models can effectively express opinionistsโ standpoints.
A Two-Dimensional Topic-Aspect Model for Discovering Multi-Faceted Topics
Paul, Michael (University of Illinois at Urbana-Champaign) | Girju, Roxana (University of Illinois at Urbana-Champaign)
This paper presents the Topic-Aspect Model (TAM), a Bayesian mixture model which jointly discovers topics and aspects. We broadly define an aspect of a document as a characteristic that spans the document, such as an underlying theme or perspective. Unlike previous models which cluster words by topic or aspect, our model can generate token assignments in both of these dimensions, rather than assuming words come from only one of two orthogonal models. We present two applications of the model. First, we model a corpus of computational linguistics abstracts, and find that the scientific topics identified in the data tend to include both a computational aspect and a linguistic aspect. For example, the computational aspect of GRAMMAR emphasizes parsing, whereas the linguistic aspect focuses on formal languages. Secondly, we show that the model can capture different viewpoints on a variety of topics in a corpus of editorials about the Israeli-Palestinian conflict. We show both qualitative and quantitative improvements in TAM over two other state-of-the-art topic models.
A Topic Model for Linked Documents and Update Rules for its Estimation
Guo, Zhen (State University of New York at Binghamton) | Zhu, Shenghuo (NEC Laboratories America, Inc.) | Zhang, Zhongfei (State University of New York at Binghamton) | Chi, Yun (NEC Laboratories America, Inc.) | Gong, Yihong (NEC Laboratories America, Inc.)
The latent topic model plays an important role in the unsupervised learning from a corpus, which provides a probabilistic interpretation of the corpus in terms of the latent topic space. An underpinning assumption which most of the topic models are based on is that the documents are assumed to be independent of each other. However, this assumption does not hold true in reality and the relations among the documents are available in different ways, such as the citation relations among the research papers. To address this limitation, in this paper we present a Bernoulli Process Topic (BPT) model, where the interdependence among the documents is modeled by a random Bernoulli process. In the BPT model a document is modeled as a distribution over topics that is a mixture of the distributions associated with the related documents. Although BPT aims at obtaining a better document modeling by incorporating the relations among the documents, it could also be applied to many applications including detecting the topics from corpora and clustering the documents. We apply the BPT model to several document collections and the experimental comparisons against several state-of-the-art approaches demonstrate the promising performance.