AITopics | Discourse & Dialogue

Collaborating Authors

Discourse & Dialogue

Understanding Language in Conversations "The problems addressed in discourse research aim to answer two general kinds of questions: (1) what information is contained in extended sequences of utterances that goes beyond the meaning of the individual utterances themselves? (2) how does the context in which an utterance is used affect the meaning of the individual utterances, or parts of them?"
– Barbara Grosz. Overview of Chapter 6: Discourse and Dialogue, Survey of the State of the Art in Human Language Technology (1996).

News Overviews Instructional Materials AI-Alerts Classics

Lexical and Hierarchical Topic Regression

Nguyen, Viet-An, Ying, Jordan L., Resnik, Philip

Neural Information Processing SystemsFeb-14-2020, 16:26:27 GMT

Inspired by a two-level theory that unifies agenda setting and ideological framing, we propose supervised hierarchical latent Dirichlet allocation (SHLDA) which jointly captures documents' multi-level topic structure and their polar response variables. Our model extends the nested Chinese restaurant process to discover a tree-structured topic hierarchy and uses both per-topic hierarchical and per-word lexical regression parameters to model the response variables. Experiments in a political domain and on sentiment analysis tasks show that SHLDA improves predictive accuracy while adding a new dimension of insight into how topics under discussion are framed. Papers published at the Neural Information Processing Systems Conference.

lexical and hierarchical topic regression, response variable

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.79)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.71)

Add feedback

On some provably correct cases of variational inference for topic models

Awasthi, Pranjal, Risteski, Andrej

Neural Information Processing SystemsFeb-14-2020, 10:56:10 GMT

Variational inference is an efficient, popular heuristic used in the context of latent variable models. We provide the first analysis of instances where variational inference algorithms converge to the global optimum, in the setting of topic models. Our initializations are natural, one of them being used in LDA-c, the mostpopular implementation of variational inference.In addition to providing intuition into why this heuristic might work in practice, the multiplicative, rather than additive nature of the variational inference updates forces us to usenon-standard proof arguments, which we believe might be of general theoretical interest. Papers published at the Neural Information Processing Systems Conference.

provably correct case, topic model, variational inference

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.69)

Add feedback

A provable SVD-based algorithm for learning topics in dominant admixture corpus

Bansal, Trapit, Bhattacharyya, Chiranjib, Kannan, Ravindran

Neural Information Processing SystemsFeb-14-2020, 09:11:48 GMT

Topic models, such as Latent Dirichlet Allocation (LDA), posit that documents are drawn from admixtures of distributions over words, known as topics. The inference problem of recovering topics from such a collection of documents drawn from admixtures, is NP-hard. Making a strong assumption called separability, [4] gave the first provable algorithm for inference. For the widely used LDA model, [6] gave a provable algorithm using clever tensor-methods. But [4, 6] do not learn topic vectors with bounded $l_1$ error (a natural measure for probability vectors).

algorithm, dominant admixture corpus, provable svd-based algorithm, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.59)

Add feedback

Spectral Methods for Supervised Topic Models

Wang, Yining, Zhu, Jun

Neural Information Processing SystemsFeb-14-2020, 07:58:22 GMT

Supervised topic models simultaneously model the latent topic structure of large collections of documents and a response variable associated with each document. Existing inference methods are based on either variational approximation or Monte Carlo sampling. This paper presents a novel spectral decomposition algorithm to recover the parameters of supervised latent Dirichlet allocation (sLDA) models. The Spectral-sLDA algorithm is provably correct and computationally efficient. We prove a sample complexity bound and subsequently derive a sufficient condition for the identifiability of sLDA.

algorithm, spectral method, supervised topic model, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)

Add feedback

Analysis of Variational Bayesian Latent Dirichlet Allocation: Weaker Sparsity Than MAP

Nakajima, Shinichi, Sato, Issei, Sugiyama, Masashi, Watanabe, Kazuho, Kobayashi, Hiroko

Neural Information Processing SystemsFeb-14-2020, 07:26:39 GMT

Latent Dirichlet allocation (LDA) is a popular generative model of various objects such as texts and images, where an object is expressed as a mixture of latent topics. In this paper, we theoretically investigate variational Bayesian (VB) learning in LDA. More specifically, we analytically derive the leading term of the VB free energy under an asymptotic setup, and show that there exist transition thresholds in Dirichlet hyperparameters around which the sparsity-inducing behavior drastically changes. Then we further theoretically reveal the notable phenomenon that VB tends to induce weaker sparsity than MAP in the LDA model, which is opposed to other models. We experimentally demonstrate the practical validity of our asymptotic theory on real-world Last.FM music data.

variational bayesian latent dirichlet allocation, weaker sparsity

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.67)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.67)

Add feedback

ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems

Zhu, Qi, Zhang, Zheng, Fang, Yan, Li, Xiang, Takanobu, Ryuichi, Li, Jinchao, Peng, Baolin, Gao, Jianfeng, Zhu, Xiaoyan, Huang, Minlie

arXiv.org Artificial IntelligenceFeb-11-2020

We present ConvLab-2, an open-source toolkit that enables researchers to build task-oriented dialogue systems with state-of-the-art models, perform an end-to-end evaluation, and diagnose the weakness of systems. As the successor of ConvLab (Lee et al., 2019b), ConvLab-2 inherits ConvLab's framework but integrates more powerful dialogue models and supports more datasets. Besides, we have developed an analysis tool and an interactive tool to assist researchers in diagnosing dialogue systems. The analysis tool presents rich statistics and summarizes common mistakes from simulated dialogues, which facilitates error analysis and system improvement. The interactive tool provides a user interface that allows developers to diagnose an assembled dialogue system by interacting with the system and modifying the output of each system component.

computational linguistic, dialogue act, dialogue system, (14 more...)

arXiv.org Artificial Intelligence

2002.04793

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Denmark > Capital Region > Copenhagen (0.05)
Oceania > Australia > Victoria > Melbourne (0.04)
(8 more...)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

I love your chain mail! Making knights smile in a fantasy game world: Open-domain goal-oriented dialogue agents

Prabhumoye, Shrimai, Li, Margaret, Urbanek, Jack, Dinan, Emily, Kiela, Douwe, Weston, Jason, Szlam, Arthur

arXiv.org Artificial IntelligenceFeb-10-2020

Dialogue research tends to distinguish between chit-chat and goal-oriented tasks. While the former is arguably more naturalistic and has a wider use of language, the latter has clearer metrics and a straightforward learning signal. Humans effortlessly combine the two, for example engaging in chit-chat with the goal of exchanging information or eliciting a specific response. Here, we bridge the divide between these two domains in the setting of a rich multi-player text-based fantasy environment where agents and humans engage in both actions and dialogue. Specifically, we train a goal-oriented model with reinforcement learning against an imitation-learned ``chit-chat'' model with two approaches: the policy either learns to pick a topic or learns to pick an utterance given the top-K utterances from the chit-chat model. We show that both models outperform an inverse model baseline and can converse naturally with their dialogue partner in order to achieve goals.

agent, arxiv preprint arxiv, utterance, (12 more...)

arXiv.org Artificial Intelligence

2002.02878

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
Europe > Germany > Saarland > Saarbrücken (0.04)

Genre:

Research Report (0.64)
Personal > Interview (0.46)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Leisure & Entertainment > Sports (0.86)
Leisure & Entertainment > Gambling (0.86)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

MA-DST: Multi-Attention Based Scalable Dialog State Tracking

Kumar, Adarsh, Ku, Peter, Goyal, Anuj Kumar, Metallinou, Angeliki, Hakkani-Tur, Dilek

arXiv.org Artificial IntelligenceFeb-7-2020

Task oriented dialog agents provide a natural language interface for users to complete their goal. Dialog State Tracking (DST), which is often a core component of these systems, tracks the system's understanding of the user's goal throughout the conversation. To enable accurate multi-domain DST, the model needs to encode dependencies between past utterances and slot semantics and understand the dialog context, including long-range cross-domain references. We introduce a novel architecture for this task to encode the conversation history and slot semantics more robustly by using attention mechanisms at multiple granularities. In particular, we use cross-attention to model relationships between the context and slots at different semantic levels and self-attention to resolve cross-domain coreferences. In addition, our proposed architecture does not rely on knowing the domain ontologies beforehand and can also be used in a zero-shot setting for new domains or unseen slot values. Our model improves the joint goal accuracy by 5% (absolute) in the full-data setting and by up to 2% (absolute) in the zero-shot setting over the present state-of-the-art on the MultiWoZ 2.1 dataset.

accuracy, conversation history, slot value, (13 more...)

arXiv.org Artificial Intelligence

2002.08898

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
(2 more...)

Genre: Research Report (0.50)

Industry: Transportation (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

r/MachineLearning - [D] A Deep Dive into Latent Dirichlet Allocation (LDA) and Its Applications on Recommender System

#artificialintelligenceFeb-5-2020, 18:10:18 GMT

I love LDA but I'm a firm believer that it doesn't have many practical/successful applications in the real world I am actually published on a couple of papers regarding applying LDA at large scale, while working at a large company. It doesn't seem to be a super common approach, and it appears to be really easy to get it wrong. Coupled with the fact that there are other approaches that are about as good, numerically, well, yeah, not a lot of people bother. Doesn't mean it can't work. Anyhow- after we applied a few tricks, the coherence of the topics produced was much (qualitatively) better than clusters being produced by other standard clustering/co-clustering approaches, albeit quantitatively worse.

application, latent dirichlet allocation, recommender system, (3 more...)

#artificialintelligence

Industry: Media > News (0.40)

Technology:

Information Technology > Communications > Social Media (0.76)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.40)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.40)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.40)

Add feedback

Goal-Oriented Multi-Task BERT-Based Dialogue State Tracker

Gulyaev, Pavel, Elistratova, Eugenia, Konovalov, Vasily, Kuratov, Yuri, Pugachev, Leonid, Burtsev, Mikhail

arXiv.org Machine LearningFeb-5-2020

Dialogue State Tracking (DST) is a core component of virtual assistants such as Alexa or Siri. To accomplish various tasks, these assistants need to support an increasing number of services and APIs. The Schema-Guided State Tracking track of the 8th Dialogue System Technology Challenge highlighted the DST problem for unseen services. The organizers introduced the Schema-Guided Dialogue (SGD) dataset with multi-domain conversations and released a zero-shot dialogue state tracking model. In this work, we propose a GOaL-Oriented Multi-task BERT-based dialogue state tracker (GOLOMB) inspired by architectures for reading comprehension question answering systems. The model "queries" dialogue history with descriptions of slots and services as well as possible values of slots. This allows to transfer slot values in multi-domain dialogues and have a capability to scale to unseen slot types. Our model achieves a joint goal accuracy of 53.97% on the SGD dataset, outperforming the baseline model.

dialogue state, goal accuracy, slot value, (14 more...)

arXiv.org Machine Learning

2002.0245

Country:

Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
Europe > Belgium (0.04)

Genre: Research Report (0.40)

Industry:

Education > Assessment & Standards > Student Performance (0.68)
Transportation > Ground (0.47)
Consumer Products & Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback