AITopics | Discourse & Dialogue

Collaborating Authors

Discourse & Dialogue

Understanding Language in Conversations "The problems addressed in discourse research aim to answer two general kinds of questions: (1) what information is contained in extended sequences of utterances that goes beyond the meaning of the individual utterances themselves? (2) how does the context in which an utterance is used affect the meaning of the individual utterances, or parts of them?"
– Barbara Grosz. Overview of Chapter 6: Discourse and Dialogue, Survey of the State of the Art in Human Language Technology (1996).

News Overviews Instructional Materials AI-Alerts Classics

Dialogue-based simulation for cultural awareness training

Adewole, Sodiq, Gharavi, Erfaneh, Shpringer, Benjamin, Bolger, Martin, Sharma, Vaibhav, Yang, Sung Ming, Brown, Donald E.

arXiv.org Artificial IntelligenceFeb-1-2020

Existing simulations designed for cultural and interpersonal skill training rely on pre-defined responses with a menu option selection interface. Using a multiple-choice interface and restricting trainees' responses may limit the trainees' ability to apply the lessons in real life situations. This systems also uses a simplistic evaluation model, where trainees' selected options are marked as either correct or incorrect. This model may not capture sufficient information that could drive an adaptive feedback mechanism to improve trainees' cultural awareness. This paper describes the design of a dialogue-based simulation for cultural awareness training. The simulation, built around a disaster management scenario involving a joint coalition between the US and the Chinese armies. Trainees were able to engage in realistic dialogue with the Chinese agent. Their responses, at different points, get evaluated by different multi-label classification models. Based on training on our dataset, the models score the trainees' responses for cultural awareness in the Chinese culture. Trainees also get feedback that informs the cultural appropriateness of their responses. The result of this work showed the following; i) A feature-based evaluation model improves the design, modeling and computation of dialogue-based training simulation systems; ii) Output from current automatic speech recognition (ASR) systems gave comparable end results compared with the output from manual transcription; iii) A multi-label classification model trained as a cultural expert gave results which were comparable with scores assigned by human annotators.

machine learning, natural language, simulation, (18 more...)

arXiv.org Artificial Intelligence

2002.00223

Country:

North America > United States > Virginia > Albemarle County > Charlottesville (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > New York (0.04)
(3 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.70)
Government > Military > Army (0.68)
Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)

Add feedback

Adversarial Training for Aspect-Based Sentiment Analysis with BERT

Karimi, Akbar, Rossi, Leonardo, Prati, Andrea, Full, Katharina

arXiv.org Machine LearningJan-31-2020

Aspect-Based Sentiment Analysis (ABSA) deals with the extraction of sentiments and their targets. Collecting labeled data for this task in order to help neural networks generalize better can be laborious and time-consuming. As an alternative, similar data to the real-world examples can be produced artificially through an adversarial process which is carried out in the embedding space. Although these examples are not real sentences, they have been shown to act as a regularization method which can make neural networks more robust. In this work, we apply adversarial training, which was put forward by Goodfellow et al. (2014), to the post-trained BERT (BERT-PT) language model proposed by Xu et al. (2019) on the two major tasks of Aspect Extraction and Aspect Sentiment Classification in sentiment analysis. After improving the results of post-trained BERT by an ablation study, we propose a novel architecture called BERT Adversarial Training (BAT) to utilize adversarial training in ABSA. The proposed model outperforms post-trained BERT in both tasks. To the best of our knowledge, this is the first study on the application of adversarial training in ABSA.

aspect-based sentiment analysis, machine learning, natural language, (15 more...)

arXiv.org Machine Learning

2001.11316

Country:

North America > Canada (0.04)
Europe > Italy (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Europe > Germany (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Improving Interaction Quality Estimation with BiLSTMs and the Impact on Dialogue Policy Learning

Ultes, Stefan

arXiv.org Artificial IntelligenceJan-21-2020

Learning suitable and well-performing dialogue behaviour in statistical spoken dialogue systems has been in the focus of research for many years. While most work which is based on reinforcement learning employs an objective measure like task success for modelling the reward signal, we use a reward based on user satisfaction estimation. We propose a novel estimator and show that it outperforms all previous estimators while learning temporal dependencies implicitly. Furthermore, we apply this novel user satisfaction estimation model live in simulated experiments where the satisfaction estimation model is trained on one domain and applied in many other domains which cover a similar task. We show that applying this model results in higher estimated satisfaction, similar task success rates and a higher robustness to noise.

computational linguistic, dialogue, estimator, (13 more...)

arXiv.org Artificial Intelligence

2001.07615

Country:

North America > United States > California > San Francisco County > San Francisco (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Germany (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Keyword-based Topic Modeling and Keyword Selection

Wang, Xingyu, Zhang, Lida, Klabjan, Diego

arXiv.org Machine LearningJan-21-2020

Certain type of documents such as tweets are collected by specifying a set of keywords. As topics of interest change with time it is beneficial to adjust keywords dynamically. The challenge is that these need to be specified ahead of knowing the forthcoming documents and the underlying topics. The future topics should mimic past topics of interest yet there should be some novelty in them. We develop a keyword-based topic model that dynamically selects a subset of keywords to be used to collect future documents. The generative process first selects keywords and then the underlying documents based on the specified keywords. The model is trained by using a variational lower bound and stochastic gradient optimization. The inference consists of finding a subset of keywords where given a subset the model predicts the underlying topic-word matrix for the unknown forthcoming documents. We compare the keyword topic model against a benchmark model using viral predictions of tweets combined with a topic model. The keyword-based topic model outperforms this sophisticated baseline model by 67%.

candidate keyword, keyword, tweet, (15 more...)

arXiv.org Machine Learning

2001.07866

Country:

North America > United States > Kentucky (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > Illinois > Cook County > Evanston (0.04)
(2 more...)

Genre:

Research Report (0.82)
Overview (0.67)

Industry: Leisure & Entertainment > Sports (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.34)

Add feedback

Optimal estimation of sparse topic models

Bing, Xin, Bunea, Florentina, Wegkamp, Marten

arXiv.org Machine LearningJan-21-2020

Topic models have become popular tools for dimension reduction and exploratory analysis of text data which consists in observed frequencies of a vocabulary of $p$ words in $n$ documents, stored in a $p\times n$ matrix. The main premise is that the mean of this data matrix can be factorized into a product of two non-negative matrices: a $p\times K$ word-topic matrix $A$ and a $K\times n$ topic-document matrix $W$. This paper studies the estimation of $A$ that is possibly element-wise sparse, and the number of topics $K$ is unknown. In this under-explored context, we derive a new minimax lower bound for the estimation of such $A$ and propose a new computationally efficient algorithm for its recovery. We derive a finite sample upper bound for our estimator, and show that it matches the minimax lower bound in many scenarios. Our estimate adapts to the unknown sparsity of $A$ and our analysis is valid for any finite $n$, $p$, $K$ and document lengths. Empirical results on both synthetic data and semi-synthetic data show that our proposed estimator is a strong competitor of the existing state-of-the-art algorithms for both non-sparse $A$ and sparse $A$, and has superior performance is many scenarios of interest.

algorithm, anchor word, estimation, (17 more...)

arXiv.org Machine Learning

2001.07861

Country:

North America > United States > New York > Tompkins County > Ithaca (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.85)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.45)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.45)

Add feedback

Domain-Aware Dialogue State Tracker for Multi-Domain Dialogue Systems

Balaraman, Vevake, Magnini, Bernardo

arXiv.org Artificial IntelligenceJan-21-2020

In such systems the dialogue state tracker (DST) is a core component, aimed to maintain a distribution over the dialogue states based on the dialogue history. A dialogue state at any turn t in the dialogue is typically represented as a set of slot-value pairs, such as ( price, moderate) or ( food, italian) in the context of restaurant reservation. The goal of the DST is to determine the user's intent and the user's goal during the dialogue and represent them as such slot-value pairs. The downstream components of a dialogue system (e.g the dialogue manager) that are responsible to choose the next system action, rely on an accurate DST for an effective dialogue strategy. Because of the importance of DST in dialogue systems, their development attracted lots of research both in academia and industry. Typical dialogue systems are modeled for a fixed ontology consisting of a single domain (Mrk ˇ si c et al. 2017; Zhong, Xiong, and Socher 2018; Ren et al. 2018), and the domain ontology schema defines intents, slots and values for each slot of the domain.

dataset, dialogue, representation, (12 more...)

arXiv.org Artificial Intelligence

2001.07526

Country:

Europe > Italy > Trentino-Alto Adige/Südtirol > Trentino Province > Trento (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Pennsylvania (0.04)
(4 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)

Add feedback

Unsupervised Sentiment Analysis for Code-mixed Data

Yadav, Siddharth, Chakraborty, Tanmoy

arXiv.org Artificial IntelligenceJan-20-2020

Code-mixing is the practice of alternating between two or more languages. Mostly observed in multilingual societies, its occurrence is increasing and therefore its importance. A major part of sentiment analysis research has been monolingual, and most of them perform poorly on code-mixed text. In this work, we introduce methods that use different kinds of multilingual and cross-lingual embeddings to efficiently transfer knowledge from monolingual text to code-mixed text for sentiment analysis of code-mixed text. Our methods can handle code-mixed text through a zero-shot learning. Our methods beat state-of-the-art on English-Spanish code-mixed sentiment analysis by absolute 3\% F1-score. We are able to achieve 0.58 F1-score (without parallel corpus) and 0.62 F1-score (with parallel corpus) on the same benchmark in a zero-shot way as compared to 0.68 F1-score in supervised settings. Our code is publicly available.

code-mixed text, computational linguistic, proceedings, (14 more...)

arXiv.org Artificial Intelligence

2001.11384

Country:

Europe > Italy > Tuscany > Florence (0.05)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
Asia > Indonesia > Bali (0.04)
(14 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
(2 more...)

Add feedback

10 Important Research Papers In Conversational AI From 2019

#artificialintelligenceJan-17-2020, 03:21:07 GMT

Conversational AI is becoming an integral part of business practice across industries. More companies are adopting the advantages chatbots bring to customer service, sales, and marketing. Even though chatbots are becoming a "must-have" asset for leading businesses, their performance is still very far from human. Researchers from major research institutions and tech leaders have explored ways to boost the performance of dialog systems by increasing the diversity of their responses, enabling emotion recognition, improving their ability to track long-term aspects of the conversation, ensuring the maintenance of a consistent persona, etc. We've searched through important conversational AI research papers published in 2019 to present you the top 10 that set the new state-of-the-art in both task-oriented and open-domain dialog systems. Subscribe to our AI Research mailing list at the bottom of this article to be alerted when we release new summaries.

dataset, dialog system, evaluation, (15 more...)

#artificialintelligence

Country: Asia > China > Hong Kong (0.04)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.74)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.70)

Add feedback

Plato Dialogue System: A Flexible Conversational AI Research Platform

Papangelis, Alexandros, Namazifar, Mahdi, Khatri, Chandra, Wang, Yi-Chia, Molino, Piero, Tur, Gokhan

arXiv.org Artificial IntelligenceJan-17-2020

As the field of Spoken Dialogue Systems and Conversational AI grows, so does the need for tools and environments that abstract away implementation details in order to expedite the development process, lower the barrier of entry to the field, and offer a common test-bed for new ideas. In this paper, we present Plato, a flexible Conversational AI platform written in Python that supports any kind of conversational agent architecture, from standard architectures to architectures with jointly-trained components, single- or multi-party interactions, and offline or online training of any conversational agent component. Plato has been designed to be easy to understand and debug and is agnostic to the underlying learning frameworks that train each component.

agent, conversational agent, plato, (16 more...)

arXiv.org Artificial Intelligence

2001.06463

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Europe > France (0.04)
(7 more...)

Genre: Research Report (0.64)

Industry: Education > Educational Setting > Online (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Agile Testing Days USA June 21–25, 2020

#artificialintelligenceJan-16-2020, 18:50:48 GMT

How do you test an application which constantly listens to the customers, learns their behaviour and create personalised engagements based out of learnings!! Today data plays a vital role in every decision making and hence making sense of the data to derive useful insights for our customers is a key for success. Sentiment Analysis is the process of classifying the data into positive, negative or neutral implemented using natural language processing (NLP) and Machine Learning techniques that helps in analysing the data to gauge public opinion, market research, monitor brand and product reputation, and understand customer experiences and is mostly offered as Sentiment Analysis as-a-Service . In this talk we will discuss the Challenges are around analysing, explicit and implict opinions, sarcasm, comparative opinions, Multilingual, Emojis, defination on neutral to just name a few and the strategies to test such applications with a use case on Airlines Sentiment (trained with tweets about airlines to identify between positive, neutral, and negative tweets).

application, customer, sentiment analysis, (1 more...)

#artificialintelligence

Country: North America > United States (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.92)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.92)

Add feedback