Goto

Collaborating Authors

 Discourse & Dialogue


Task-oriented Dialogue Systems: performance vs. quality-optima, a review

arXiv.org Artificial Intelligence

Task-oriented dialogue systems (TODS) are continuing to rise in popularity as various industries find ways to effectively harness their capabilities, saving both time and money. However, even state-of-the-art TODS are not yet reaching their full potential. TODS typically have a primary design focus on completing the task at hand, so the metric of task-resolution should take priority. Other conversational quality attributes that may point to the success, or otherwise, of the dialogue, may be ignored. This can cause interactions between human and dialogue system that leave the user dissatisfied or frustrated. This paper explores the literature on evaluative frameworks of dialogue systems and the role of conversational quality attributes in dialogue systems, looking at if, how, and where they are utilised, and examining their correlation with the performance of the dialogue system.


STEM: Unsupervised STructural EMbedding for Stance Detection

arXiv.org Artificial Intelligence

Stance detection is an important task, supporting many downstream tasks such as discourse parsing and modeling the propagation of fake news, rumors, and science denial. In this paper, we propose a novel framework for stance detection. Our framework is unsupervised and domain-independent. Given a claim and a multi-participant discussion - we construct the interaction network from which we derive topological embedding for each speaker. These speaker embedding enjoy the following property: speakers with the same stance tend to be represented by similar vectors, while antipodal vectors represent speakers with opposing stances. These embedding are then used to divide the speakers into stance-partitions. We evaluate our method on three different datasets from different platforms. Our method outperforms or is comparable with supervised models while providing confidence levels for its output. Furthermore, we demonstrate how the structural embedding relate to the valence expressed by the speakers. Finally, we discuss some limitations inherent to the framework.


Continual Learning with Knowledge Transfer for Sentiment Classification

arXiv.org Artificial Intelligence

This paper studies continual learning (CL) for sentiment classification (SC). In this setting, the CL system learns a sequence of SC tasks incrementally in a neural network, where each task builds a classifier to classify the sentiment of reviews of a particular product category or domain. Two natural questions are: Can the system transfer the knowledge learned in the past from the previous tasks to the new task to help it learn a better model for the new task? And, can old models for previous tasks be improved in the process as well? This paper proposes a novel technique called KAN to achieve these objectives. KAN can markedly improve the SC accuracy of both the new task and the old tasks via forward and backward knowledge transfer. The effectiveness of KAN is demonstrated through extensive experiments.


Multimodal Interactions Using Pretrained Unimodal Models for SIMMC 2.0

arXiv.org Artificial Intelligence

This paper presents our work on the Situated Interactive MultiModal Conversations 2.0 challenge held at Dialog State Tracking Challenge 10. SIMMC 2.0 includes 4 subtasks, and we introduce our multimodal approaches for the subtask \#1, \#2 and the generation of subtask \#4. SIMMC 2.0 dataset is a multimodal dataset containing image and text information, which is more challenging than the problem of only text-based conversations because it must be solved by understanding the relationship between image and text. Therefore, since there is a limit to solving only text models such as BERT or GPT2, we propose a multimodal model combining image and text. We first pretrain the multimodal model to understand the relationship between image and text, then finetune our model for each task. We achieve the 3rd best performance in subtask \#1, \#2 and a runner-up in the generation of subtask \#4. The source code is available at https://github.com/rungjoo/simmc2.0.


Adapting Document-Grounded Dialog Systems to Spoken Conversations using Data Augmentation and a Noisy Channel Model

arXiv.org Artificial Intelligence

This paper summarizes our submission to Task 2 of the second track of the 10th Dialog System Technology Challenge (DSTC10) "Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations". Similar to the previous year's iteration, the task consists of three subtasks: detecting whether a turn is knowledge seeking, selecting the relevant knowledge document and finally generating a grounded response. This year, the focus lies on adapting the system to noisy ASR transcripts. We explore different approaches to make the models more robust to this type of input and to adapt the generated responses to the style of spoken conversations. For the latter, we get the best results with a noisy channel model that additionally reduces the number of short and generic responses. Our best system achieved the 1st rank in the automatic and the 3rd rank in the human evaluation of the challenge.


Predicting Above-Sentence Discourse Structure using Distant Supervision from Topic Segmentation

arXiv.org Artificial Intelligence

RST-style discourse parsing plays a vital role in many NLP tasks, revealing the underlying semantic/pragmatic structure of potentially complex and diverse documents. Despite its importance, one of the most prevailing limitations in modern day discourse parsing is the lack of large-scale datasets. To overcome the data sparsity issue, distantly supervised approaches from tasks like sentiment analysis and summarization have been recently proposed. Here, we extend this line of research by exploiting distant supervision from topic segmentation, which can arguably provide a strong and oftentimes complementary signal for high-level discourse structures. Experiments on two human-annotated discourse treebanks confirm that our proposal generates accurate tree structures on sentence and paragraph level, consistently outperforming previous distantly supervised models on the sentence-to-document task and occasionally reaching even higher scores on the sentence-to-paragraph level.


Rome's Libraries Readers' Comments Analysis with Deep Learning

#artificialintelligence

This posts describes, along with Python code, an analysis of the readers' comments open dataset from Rome's libraries made publicly available by "Istituzione Biblioteche di Roma"ยน. The analysis leverages topic modeling techniques to find recurring topics among readers' comments, and thus determine, by inference, the themes of the borrowed books and the interests of the readers. Moreover, sentiment analysis is performed to determine whether customers comments are positive or negative. Finally, readers data (age and occupation) are used to achieve customers segmentation via clustering techniques. This provides insights on the topics of borrowed books, the readers sentiment and different readers clusters.


Discourse-Aware Prompt Design for Text Generation

arXiv.org Machine Learning

Current efficient fine-tuning methods (e.g., adapters, prefix-tuning, etc.) have optimized conditional text generation via training a small set of extra parameters of the neural language model, while freezing the rest for efficiency. While showing strong performance on some generation tasks, they don't generalize across all generation tasks. In this work, we show that prompt based conditional text generation can be improved with simple and efficient methods that simulate modeling the discourse structure of human written text. We introduce two key design choices: First we show that a higher-level discourse structure of human written text can be modelled with \textit{hierarchical blocking} on prefix parameters that enable spanning different parts of the input and output text and yield more coherent output generations. Second, we propose sparse prefix tuning by introducing \textit{attention sparsity} on the prefix parameters at different layers of the network and learn sparse transformations on the softmax-function, respectively. We find that sparse attention enables the prefix-tuning to better control of the input contents (salient facts) yielding more efficient tuning of the prefix-parameters. Experiments on a wide-variety of text generation tasks show that structured design of prefix parameters can achieve comparable results to fine-tuning all parameters while outperforming standard prefix-tuning on all generation tasks even in low-resource settings.


Sentiment Analysis

#artificialintelligence

Sentiment analysis is a methodology for analysing text data and classifying the sentiment contained within it. It is a useful technique for every customer facing industry (retail, finance, telco, utilities, etc) which needs to understand how consumers are thinking about them and their products, features and services. Sentiment analysis is a key feature in understanding and predicting churn, developing more accurate customer segmentations and creating recommender systems which have a good take-up of product and service offerings. Today, organisations have access to vast amounts of digital data from multiple platforms, including social media, review platforms, chatbots and influencer marketing campaigns, as well as internal CRM and Enterprise Marketing Systems. This heterogeneous data environment means that multiple types of sentiment model may be needed to truly understand customers, with different models used for understanding emotions, opinions, future intent or what aspects of a product or service are liked or disliked.


Nate Silver savages media study claiming harsher treatment of Biden compared to Trump: 'Complete crap'

FOX News

In media news today, CNN and Chris Cuomo issue scathing statements against each other, the former anchor announces he's leaving his SiriusXM radio show, and a New York Times op-ed gets mocked for fearing free library is contributing to gentrification. Pollster Nate Silver on Monday savaged the analytics behind a recent Washington Post column claiming President Biden was being treated just as badly, or worse, by the media than former President Trump. In the piece published last week, liberal columnist Dana Milbank complained about Biden's media coverage being overly tough and implored journalists to do "soul-searching" and "think about what it is we're delivering to people." In a series of tweets, Silver argued the piece's "sentiment analysis" measuring the positivity and negativity of particular articles written about Trump and Biden was "complete crap," and gave examples to show how the data could be skewed more positively or negatively than it should have been. "To this good thread explaining why the'sentiment analysis' cited in the [Dana Milbank] WaPo article this weekend is complete crap--the analysis was used to make the claim that the press is just negative toward Biden as Trump--I'll also add a couple of comments based on their data," Silver wrote.