Goto

Collaborating Authors

 Discourse & Dialogue


How to Build Emotion Text Analyzer with Python (NLP)

#artificialintelligence

In this tutorial I will guide you on how to detect emotions associated with textual data which can be either classified as either positive or negative and how can you apply that knowledge in variety of applications depending on what you wanna do. In this tutorial I will guide you on how to detect emotions associated with textual data which can be either classified as either positive or negative and how can you apply that knowledge in variety of applications depending on what you wanna do. For instance you want to perform automatic analysis of customer feedback with directly reading them as either positive or negative feedback you will need to Sentiment analyzer to check the negativity or positivity of the textual data.


Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption

arXiv.org Artificial Intelligence

Spoken dialog systems have seen applications in many domains, including medical for automatic conversational diagnosis. State-of-the-art dialog managers are usually driven by deep reinforcement learning models, such as deep Q networks (DQNs), which learn by interacting with a simulator to explore the entire action space since real conversations are limited. However, the DQN-based automatic diagnosis models do not achieve satisfying performances when adapted to new, unseen diseases with only a few training samples. In this work, we propose the Prototypical Q Networks (ProtoQN) as the dialog manager for the automatic diagnosis systems. The model calculates prototype embeddings with real conversations between doctors and patients, learning from them and simulator-augmented dialogs more efficiently. We create both supervised and few-shot learning tasks with the Muzhi corpus. Experiments showed that the ProtoQN significantly outperformed the baseline DQN model in both supervised and few-shot learning scenarios, and achieves state-of-the-art few-shot learning performances.


Theory Interpretations for Topic Models

AAAI Conferences

Many machine learning models have to incorporate latent variables to learn target concepts on training data. The variables are understood only statistically and optimize a statistical property such as likelihood, but usually they are not understood in human understandable semantical terms. An example for such a situation is that of topics in the generative Bayesian model called latent Dirichlet allocation, modelling topics as word distributions from the vocabulary of documents. This paper proposes a framework of classifications and theory interpretations to be used as a construction and analysis tool for exactly such situations. As a proof of concept an algorithm is considered that uses latent Dirichlet allocation topics induced by a corpus to enrich the given sets of RDF annotations on each text of the corpus. The general framework of classifications is used to discuss the role of the algorithm in finding representations of topics by RDF triples.


EmpTransfo: A Multi-Head Transformer Architecture for Creating Empathetic Dialog Systems

AAAI Conferences

Understanding emotions and responding accordingly is one of the biggest challenges of dialog systems. In this paper, we present EmpTransfo, a multi-head Transformer architecture for creating an empathetic dialog system. We show that utilizing the history of emotions and other metadata can improve the quality of generated conversations by the dialog system. EmpTransfo utilizes state-of-the-art pre-trained models (e.g., OpenAI-GPT) for language generation, though models with different sizes can be used. Our experimental results using a challenging language corpus show that the proposed approach outperforms other models in terms of Hit@1 and PPL.


Neural Learning for Aspect Phrase Extraction and Classification in Sentiment Analysis

AAAI Conferences

In this study, we present an approach and a dataset for aspect-based sentiment analysis, showing how we extract and classify aspect phrases. The research field of aspect-based sentiment analysis aims at finding opinions expressed for individual characteristics of products or services in natural language texts. In the literature, reviews for common products or services such as smartphones or restaurants were mostly investigated. We describe our newly annotated dataset of German physician reviews, which presents a sensitive and linguistically complex domain, taking care to describe the annotation process and the functionality of our neural network approach. Finally, we introduce a model that can extract and classify aspect phrases in one step while obtaining an F1 score of 80%. As we employ our algorithm in a more complex domain, we believe that our study outperforms other studies.


Is Your Goal-Oriented Dialog Model Performing Really Well? Empirical Analysis of System-wise Evaluation

arXiv.org Artificial Intelligence

There is a growing interest in developing goal-oriented dialog systems which serve users in accomplishing complex tasks through multi-turn conversations. Although many methods are devised to evaluate and improve the performance of individual dialog components, there is a lack of comprehensive empirical study on how different components contribute to the overall performance of a dialog system. In this paper, we perform a system-wise evaluation and present an empirical analysis on different types of dialog systems which are composed of different modules in different settings. Our results show that (1) a pipeline dialog system trained using fine-grained supervision signals at different component levels often obtains better performance than the systems that use joint or end-to-end models trained on coarse-grained labels, (2) component-wise, single-turn evaluation results are not always consistent with the overall performance of a dialog system, and (3) despite the discrepancy between simulators and human users, simulated evaluation is still a valid alternative to the costly human evaluation especially in the early stage of development.


Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation

arXiv.org Artificial Intelligence

Dialogue policy optimization often obtains feedback until task completion in task-oriented dialogue systems. This is insufficient for training intermediate dialogue turns since supervision signals (or rewards) are only provided at the end of dialogues. To address this issue, reward learning has been introduced to learn from state-action pairs of an optimal policy to provide turn-by-turn rewards. This approach requires complete state-action annotations of human-to-human dialogues (i.e., expert demonstrations), which is labor intensive. To overcome this limitation, we propose a novel reward learning approach for semi-supervised policy learning. The proposed approach learns a dynamics model as the reward function which models dialogue progress (i.e., state-action sequences) based on expert demonstrations, either with or without annotations. The dynamics model computes rewards by predicting whether the dialogue progress is consistent with expert demonstrations. We further propose to learn action embeddings for a better generalization of the reward function. The proposed approach outperforms competitive policy learning baselines on MultiWOZ, a benchmark multi-domain dataset.


History for Visual Dialog: Do we really need it?

arXiv.org Artificial Intelligence

Visual Dialog involves "understanding" the dialog history (what has been discussed previously) and the current question (what is asked), in addition to grounding information in the image, to generate the correct response. In this paper, we show that co-attention models which explicitly encode dialog history outperform models that don't, achieving state-of-the-art performance (72 % NDCG on val set). However, we also expose shortcomings of the crowd-sourcing dataset collection procedure by showing that history is indeed only required for a small amount of the data and that the current evaluation metric encourages generic replies. To that end, we propose a challenging subset (VisDialConv) of the VisDial val set and provide a benchmark of 63% NDCG.


Adaptive Dialog Policy Learning with Hindsight and User Modeling

arXiv.org Artificial Intelligence

Reinforcement learning methods have been used to compute dialog policies from language-based interaction experiences. Efficiency is of particular importance in dialog policy learning, because of the considerable cost of interacting with people, and the very poor user experience from low-quality conversations. Aiming at improving the efficiency of dialog policy learning, we develop algorithm LHUA (Learning with Hindsight, User modeling, and Adaptation) that, for the first time, enables dialog agents to adaptively learn with hindsight from both simulated and real users. Simulation and hindsight provide the dialog agent with more experience and more (positive) reinforcements respectively. Experimental results suggest that, in success rate and policy quality, LHUA outperforms competitive baselines from the literature, including its no-simulation, no-adaptation, and no-hindsight counterparts.


Building A User-Centric and Content-Driven Socialbot

arXiv.org Artificial Intelligence

To build Sounding Board, we develop a system architecture that is capable of accommodating dialog strategies that we designed for socialbot conversations. The architecture consists of a multi-dimensional language understanding module for analyzing user utterances, a hierarchical dialog management framework for dialog context tracking and complex dialog control, and a language generation process that realizes the response plan and makes adjustments for speech synthesis. Additionally, we construct a new knowledge base to power the socialbot by collecting social chat content from a variety of sources. An important contribution of the system is the synergy between the knowledge base and the dialog management, i.e., the use of a graph structure to organize the knowledge base that makes dialog control very efficient in bringing related content to the discussion. Using the data collected from Sounding Board during the competition, we carry out in-depth analyses of socialbot conversations and user ratings which provide valuable insights in evaluation methods for socialbots. We additionally investigate a new approach for system evaluation and diagnosis that allows scoring individual dialog segments in the conversation. Finally, observing that socialbots suffer from the issue of shallow conversations about topics associated with unstructured data, we study the problem of enabling extended socialbot conversations grounded on a document. To bring together machine reading and dialog control techniques, a graph-based document representation is proposed, together with methods for automatically constructing the graph. Using the graph-based representation, dialog control can be carried out by retrieving nodes or moving along edges in the graph. To illustrate the usage, a mixed-initiative dialog strategy is designed for socialbot conversations on news articles.