Goto

Collaborating Authors

 Discourse & Dialogue


The Role of Data Science in Sentiment Analysis

#artificialintelligence

Understanding individuals' feelings are fundamental for organizations since clients can communicate their feelings and sentiments more transparently than ever before. By automatically analyzing customer feedback, from study reactions to social media discussions, brands can listen mindfully to their clients, and tailor products and services to address their issues. Sentiment analysis is a machine learning method that recognizes polarity (for example a positive or negative thought) within the text, whether a whole document, paragraph, sentence, or clause. Marketing is ending up being one of the artworks most disrupted by the digital revolution. A lot to the aversion of customary marketing proponents and maybe to the pleasure of technologists, it is presently a lot about codifying the whole knowledge chain โ€“ catching the abundance of digital data, sorting out it, applying algorithms to process it and taking care of back noteworthy decisions to different functionsโ€“ all in real-time, with end to end automation, and at lightening quick speed.


From Twitter to Traffic Predictor: Next-Day Morning Traffic Prediction Using Social Media Data

arXiv.org Machine Learning

The effectiveness of traditional traffic prediction methods is often extremely limited when forecasting traffic dynamics in early morning. The reason is that traffic can break down drastically during the early morning commute, and the time and duration of this break-down vary substantially from day to day. Early morning traffic forecast is crucial to inform morning-commute traffic management, but they are generally challenging to predict in advance, particularly by midnight. In this paper, we propose to mine Twitter messages as a probing method to understand the impacts of people's work and rest patterns in the evening/midnight of the previous day to the next-day morning traffic. The model is tested on freeway networks in Pittsburgh as experiments. The resulting relationship is surprisingly simple and powerful. We find that, in general, the earlier people rest as indicated from Tweets, the more congested roads will be in the next morning. The occurrence of big events in the evening before, represented by higher or lower tweet sentiment than normal, often implies lower travel demand in the next morning than normal days. Besides, people's tweeting activities in the night before and early morning are statistically associated with congestion in morning peak hours. We make use of such relationships to build a predictive framework which forecasts morning commute congestion using people's tweeting profiles extracted by 5 am or as late as the midnight prior to the morning. The Pittsburgh study supports that our framework can precisely predict morning congestion, particularly for some road segments upstream of roadway bottlenecks with large day-to-day congestion variation. Our approach considerably outperforms those existing methods without Twitter message features, and it can learn meaningful representation of demand from tweeting profiles that offer managerial insights.


Learning Knowledge Bases with Parameters for Task-Oriented Dialogue Systems

arXiv.org Artificial Intelligence

Task-oriented dialogue systems are either modularized with separate dialogue state tracking (DST) and management steps or end-to-end trainable. In either case, the knowledge base (KB) plays an essential role in fulfilling user requests. Modularized systems rely on DST to interact with the KB, which is expensive in terms of annotation and inference time. End-to-end systems use the KB directly as input, but they cannot scale when the KB is larger than a few hundred entries. In this paper, we propose a method to embed the KB, of any size, directly into the model parameters. The resulting model does not require any DST or template responses, nor the KB as input, and it can dynamically update its KB via fine-tuning. We evaluate our solution in five task-oriented dialogue datasets with small, medium, and large KB size. Our experiments show that end-to-end models can effectively embed knowledge bases in their parameters and achieve competitive performance in all evaluated datasets.


MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems

arXiv.org Artificial Intelligence

In this paper, we propose Minimalist Transfer Learning (MinTL) to simplify the system design process of task-oriented dialogue systems and alleviate the over-dependency on annotated data. MinTL is a simple yet effective transfer learning framework, which allows us to plug-and-play pre-trained seq2seq models, and jointly learn dialogue state tracking and dialogue response generation. Unlike previous approaches, which use a copy mechanism to "carryover" the old dialogue states to the new one, we introduce Levenshtein belief spans (Lev), that allows efficient dialogue state tracking with a minimal generation length. We instantiate our learning framework with two pre-trained backbones: T5 and BART, and evaluate them on MultiWOZ. Extensive experiments demonstrate that: 1) our systems establish new state-of-the-art results on end-to-end response generation, 2) MinTL-based systems are more robust than baseline methods in the low resource setting, and they achieve competitive results with only 20\% training data, and 3) Lev greatly improves the inference efficiency.


Day 123 of #NLP365: NLP Papers Summary -- Context-aware Embedding for Targeted Aspect-basedโ€ฆ

#artificialintelligence

Proposed context-aware embeddings to refine the embeddings of targets and aspects using highly correlative words. A lot of previous work uses context-independent vectors to construct targets and aspects embeddings, which lead to loss of semantic information and failed to capture the interconnection between the specific target, its aspect, and its context. This approach has led to SOTA results in targeted aspect-based sentiment analysis (TABSA). The goal of TABSA is that given an input sentence, we want to extract the sentiment of the aspect that belongs to a target. The refined target embeddings can be computed by multiplying the sentence word embeddings X with the sparse coefficient vector u'.


Sentiment Analysis with NLP using Python and Flask

#artificialintelligence

There are two main features in SA. Sentiment Analysis is widely used in the area of Machine Learning under Natural Language Processing. In this course, you will know how to use sentiment analysis on reviews with the help of a NLP library called TextBlob. You will learn and develop a Flask based WebApp that takes reviews from the user and perform sentiment analysis on the same.


Financial Evolution: AI, Machine Learning & Sentiment Analysis

#artificialintelligence

Artificial Intelligence and Machine Learning (AI & ML) and Sentiment Analysis are said to "predict the future through analysing the past" โ€“ the Holy Grail of the finance sector. They can replicate cognitive decisions made by humans yet avoid the behavioural biases inherent in humans. Processing news data and social media data and classifying (market) sentiment and how it impacts Financial Markets is a growing area of research. The field has recently progressed further with many new "alternative" data sources, such as email receipts, credit/debit card transactions, weather, geo-location, satellite data, Twitter, Micro-blogs and search engine results. AI & ML are gaining adoption in the financial services industry especially in the context of compliance, investment decisions and risk management.


WESSA at SemEval-2020 Task 9: Code-Mixed Sentiment Analysis using Transformers

arXiv.org Artificial Intelligence

In this paper, we describe our system submitted for SemEval 2020 Task 9, Sentiment Analysis for Code-Mixed Social Media Text alongside other experiments. Our best performing system is a Transfer Learning-based model that fine-tunes "XLM-RoBERTa", a transformer-based multilingual masked language model, on monolingual English and Spanish data and Spanish-English code-mixed data. Our system outperforms the official task baseline by achieving a 70.1% average F1-Score on the official leaderboard using the test set. For later submissions, our system manages to achieve a 75.9% average F1-Score on the test set using CodaLab username "ahmed0sultan".


An Improved Approach of Intention Discovery with Machine Learning for POMDP-based Dialogue Management

arXiv.org Artificial Intelligence

An Embodied Conversational Agent (ECA) is an intelligent agent that works as the front end of software applications to interact with users through verbal/nonverbal expressions and to provide online assistance without the limits of time, location, and language. To help to improve the experience of human-computer interaction, there is an increasing need to empower ECA with not only the realistic look of its human counterparts but also a higher level of intelligence. This thesis first highlights the main topics related to the construction of ECA, including different approaches of dialogue management, and then discusses existing techniques of trend analysis for its application in user classification. As a further refinement and enhancement to prior work on ECA, this thesis research proposes a cohesive framework to integrate emotion-based facial animation with improved intention discovery. In addition, a machine learning technique is introduced to support sentiment analysis for the adjustment of policy design in POMDP-based dialogue management. The proposed research work is going to improve the accuracy of intention discovery while reducing the length of dialogues.


A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning

arXiv.org Artificial Intelligence

Structured belief states are crucial for user goal tracking and database query in task-oriented dialog systems. However, training belief trackers often requires expensive turn-level annotations of every user utterance. In this paper we aim at alleviating the reliance on belief state labels in building end-to-end dialog systems, by leveraging unlabeled dialog data towards semi-supervised learning. We propose a probabilistic dialog model, called the LAtent BElief State (LABES) model, where belief states are represented as discrete latent variables and jointly modeled with system responses given user inputs. Such latent variable modeling enables us to develop semi-supervised learning under the principled variational learning framework. Furthermore, we introduce LABES-S2S, which is a copy-augmented Seq2Seq model instantiation of LABES. In supervised experiments, LABES-S2S obtains strong results on three benchmark datasets of different scales. In utilizing unlabeled dialog data, semi-supervised LABES-S2S significantly outperforms both supervised-only and semi-supervised baselines. Remarkably, we can reduce the annotation demands to 50% without performance loss on MultiWOZ.