Discourse & Dialogue
Accountable Error Characterization
Misra, Amita, Liu, Zhe, Mahmud, Jalal
Customers of machine learning systems demand accountability from the companies employing these algorithms for various prediction tasks. Accountability requires understanding of system limit and condition of erroneous predictions, as customers are often interested in understanding the incorrect predictions, and model developers are absorbed in finding methods that can be used to get incremental improvements to an existing system. Therefore, we propose an accountable error characterization method, AEC, to understand when and where errors occur within the existing black-box models. AEC, as constructed with human-understandable linguistic features, allows the model developers to automatically identify the main sources of errors for a given classification system. It can also be used to sample for the set of most informative input points for a next round of training. We perform error detection for a sentiment analysis task using AEC as a case study. Our results on the sample sentiment task show that AEC is able to characterize erroneous predictions into human understandable categories and also achieves promising results on selecting erroneous samples when compared with the uncertainty-based sampling.
Researchers develop artificial intelligence that can detect sarcasm in social media
Computer science researchers at the University of Central Florida have developed a sarcasm detector. Social media has become a dominant form of communication for individuals, and for companies looking to market and sell their products and services. Properly understanding and responding to customer feedback on Twitter, Facebook and other social media platforms is critical for success, but it is incredibly labor intensive. That's where sentiment analysis comes in. The term refers to the automated process of identifying the emotion--either positive, negative or neutral--associated with text.
Explaining Outcomes of Multi-Party Dialogues using Causal Learning
Sinha, Priyanka, Mitra, Pabitra, da Costa, Antonio Anastasio Bruto, Kekatos, Nikolaos
Multi-party dialogues are common in enterprise social media on technical as well as non-technical topics. The outcome of a conversation may be positive or negative. It is important to analyze why a dialogue ends with a particular sentiment from the point of view of conflict analysis as well as future collaboration design. We propose an explainable time series mining algorithm for such analysis. A dialogue is represented as an attributed time series of occurrences of keywords, EMPATH categories, and inferred sentiments at various points in its progress. A special decision tree, with decision metrics that take into account temporal relationships between dialogue events, is used for predicting the cause of the outcome sentiment. Interpretable rules mined from the classifier are used to explain the prediction. Experimental results are presented for the enterprise social media posts in a large company.
Intelligent Conversational Android ERICA Applied to Attentive Listening and Job Interview
Kawahara, Tatsuya, Inoue, Koji, Lala, Divesh
Following the success of spoken dialogue systems (SDS) in smartphone assistants and smart speakers, a number of communicative robots are developed and commercialized. Compared with the conventional SDSs designed as a human-machine interface, interaction with robots is expected to be in a closer manner to talking to a human because of the anthropomorphism and physical presence. The goal or task of dialogue may not be information retrieval, but the conversation itself. In order to realize human-level "long and deep" conversation, we have developed an intelligent conversational android ERICA. We set up several social interaction tasks for ERICA, including attentive listening, job interview, and speed dating. To allow for spontaneous, incremental multiple utterances, a robust turn-taking model is implemented based on TRP (transition-relevance place) prediction, and a variety of backchannels are generated based on time frame-wise prediction instead of IPU-based prediction. We have realized an open-domain attentive listening system with partial repeats and elaborating questions on focus words as well as assessment responses. It has been evaluated with 40 senior people, engaged in conversation of 5-7 minutes without a conversation breakdown. It was also compared against the WOZ setting. We have also realized a job interview system with a set of base questions followed by dynamic generation of elaborating questions. It has also been evaluated with student subjects, showing promising results.
Analyzing Sentiment Using Vader
Vader stands for Valence Aware Dictionary and sEntiment Reasoner. It is a lexicon and rule based tool for sentiment analysis. It is specifically attuned to sentiments expressed in social media. It is used for analyzing the sentiment of text which contains both positive and negative polarity. The main function of VADER is to quantify how much of positive or negative emotion is present in the text. It can also measure the intensity of emotion.
Interventional Aspect-Based Sentiment Analysis
Bi, Zhen, Zhang, Ningyu, Ye, Ganqiang, Yu, Haiyang, Chen, Xi, Chen, Huajun
Recent neural-based aspect-based sentiment analysis approaches, though achieving promising improvement on benchmark datasets, have reported suffering from poor robustness when encountering confounder such as non-target aspects. In this paper, we take a causal view to addressing this issue. We propose a simple yet effective method, namely, Sentiment Adjustment Figure 1: The causal graph of ABSA. We build our (SENTA), by applying a backdoor adjustment causal model over three main variables: target feature to disentangle those confounding factors. X, predictions Y and confounding factor C. Experimental results on the Aspect Robustness Our goal is to alleviate confounding factors, which is Test Set (ARTS) dataset demonstrate caused by X C, Y C. that our approach improves the performance while maintaining accuracy in the original test set
Few-shot Learning for Topic Modeling
Topic models have been successfully used for analyzing text documents. However, with existing topic models, many documents are required for training. In this paper, we propose a neural network-based few-shot learning method that can learn a topic model from just a few documents. The neural networks in our model take a small number of documents as inputs, and output topic model priors. The proposed method trains the neural networks such that the expected test likelihood is improved when topic model parameters are estimated by maximizing the posterior probability using the priors based on the EM algorithm. Since each step in the EM algorithm is differentiable, the proposed method can backpropagate the loss through the EM algorithm to train the neural networks. The expected test likelihood is maximized by a stochastic gradient descent method using a set of multiple text corpora with an episodic training framework. In our experiments, we demonstrate that the proposed method achieves better perplexity than existing methods using three real-world text document sets.
A recipe for annotating grounded clarifications
Benotti, Luciana, Blackburn, Patrick
In Clarifications are crucial to robust dialogues, and Sections 4 and A we test the practical implications pragmatic factors -- notably those shaped by the of our recipe by identifying and characterizing (according world modalities situating the conversation -- have to their modalities) the clarifications in a a key role to play. Referring expressions have in corpus of long dialogues in English. In Section 5 vision a modality in which to ground clarifications we turn to the claim that clarifications are rare in concerning objects in the world (de Vries et al., dialogue datasets (Ginzburg, 2012), and that current 2017); navigation instructions have in movement data-hungry algorithms cannot learn them. We a modality in which to ground clarifications concerning argue that whether they are rare or not depends collaborative wayfinding (Thomason et al., on pragmatic factors of the conversation and the 2019). Clarifications grounded in situationally relevant modality of the grounded clarification, and discuss modalities boost the redundancy required to the impact of six such factors. After presenting learn to use language without explicit supervision, potential objections and our responses in Section 6, as they make explicit the process of negotiating the we conclude in Section 7 by noting ethical issues communicative intent. But despite its importance, raised by socioperceptive dialogue systems that work on clarification remains scattered.
Variational Weakly Supervised Sentiment Analysis with Posterior Regularization
Sentiment analysis is an important task in natural language processing (NLP). Most of existing state-of-the-art methods are under the supervised learning paradigm. However, human annotations can be scarce. Thus, we should leverage more weak supervision for sentiment analysis. In this paper, we propose a posterior regularization framework for the variational approach to the weakly supervised sentiment analysis to better control the posterior distribution of the label assignment. The intuition behind the posterior regularization is that if extracted opinion words from two documents are semantically similar, the posterior distributions of two documents should be similar. Our experimental results show that the posterior regularization can improve the original variational approach to the weakly supervised sentiment analysis and the performance is more stable with smaller prediction variance.
SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
Kottur, Satwik, Moon, Seungwhan, Geramifard, Alborz, Damavandi, Babak
We present a new corpus for the Situated and Interactive Multimodal Conversations, SIMMC 2.0, aimed at building a successful multimodal assistant agent. Specifically, the dataset features 11K task-oriented dialogs (117K utterances) between a user and a virtual assistant on the shopping domain (fashion and furniture), grounded in situated and photo-realistic VR scenes. The dialogs are collected using a two-phase pipeline, which first generates simulated dialog flows via a novel multimodal dialog simulator we propose, followed by manual paraphrasing of the generated utterances. In this paper, we provide an in-depth analysis of the collected dataset, and describe in detail the four main benchmark tasks we propose for SIMMC 2.0. The preliminary analysis with a baseline model highlights the new challenges that the SIMMC 2.0 dataset brings, suggesting new directions for future research. Our dataset and code will be made publicly available.