Discourse & Dialogue
Real-Time Stock News Sentiment Analyzer
Investing in the Stock Market is a great way of tackling Inflation. Inflation refers to the rise in the prices of most goods and services of daily or common use, such as food, clothing, housing, recreation, transport, consumer staples, etc. Basically, with 100 rupees you won't be able to buy as many vada pavs (wadapavs) as you could last year. In the pandemic-struck financial year of 2020–2021 a whopping 142 lakh new investors have started trading in the stock market. One key skill required to make good investments in the stock market is being able to correctly analyze news related to the finance and the business sector. Which company is diversifying its sectors or which company is showing signs of heading towards bankruptcy?
Modeling Performance in Open-Domain Dialogue with PARADISE
Walker, Marilyn, Harmon, Colin, Graupera, James, Harrison, Davan, Whittaker, Steve
There has recently been an explosion of work on spoken dialogue systems, along with an increased interest in open-domain systems that engage in casual conversations on popular topics such as movies, books and music. These systems aim to socially engage, entertain, and even empathize with their users. Since the achievement of such social goals is hard to measure, recent research has used dialogue length or human ratings as evaluation metrics, and developed methods for automatically calculating novel metrics, such as coherence, consistency, relevance and engagement. Here we develop a PARADISE model for predicting the performance of Athena, a dialogue system that has participated in thousands of conversations with real users, while competing as a finalist in the Alexa Prize. We use both user ratings and dialogue length as metrics for dialogue quality, and experiment with predicting these metrics using automatic features that are both system dependent and independent. Our goal is to learn a general objective function that can be used to optimize the dialogue choices of any Alexa Prize system in real time and evaluate its performance. Our best model for predicting user ratings gets an R$^2$ of .136 with a DistilBert model, and the best model for predicting length with system independent features gets an R$^2$ of .865, suggesting that conversation length may be a more reliable measure for automatic training of dialogue systems.
Sentiment Analysis with Scikit-learn and GCP
For this project, I wanted to design a model that would do a simple classification of whether a phrase is positive or negative. Since I'm only looking for a binary result, I chose to use Sklearn's logistic regression module. If you were trying to predict more than two labels, you would have to use a different ML model. The data used is a corpus of 5,000 movie reviews -- 2,500 positive and 2,500 negative. The model has an accuracy of 90% and probably performs better with text that is similar to a review because it would more like the training data.
Distributionally Robust Classifiers in Sentiment Analysis
Li, Shilun, Li, Renee, Zhang, Carina
In this paper, we propose sentiment classification models based on BERT integrated with DRO (Distributionally Robust Classifiers) to improve model performance on datasets with distributional shifts. We added 2-Layer Bi-LSTM, projection layer (onto simplex or Lp ball), and linear layer on top of BERT to achieve distributionally robustness. We considered one form of distributional shift (from IMDb dataset to Rotten Tomatoes dataset). We have confirmed through experiments that our DRO model does improve performance on our test set with distributional shift from the training set.
The R package sentometrics to compute, aggregate and predict with textual sentiment
Ardia, David, Bluteau, Keven, Borms, Samuel, Boudt, Kris
We provide a hands-on introduction to optimized textual sentiment indexation using the R package sentometrics. Textual sentiment analysis is increasingly used to unlock the potential information value of textual data. The sentometrics package implements an intuitive framework to efficiently compute sentiment scores of numerous texts, to aggregate the scores into multiple time series, and to use these time series to predict other variables. The workflow of the package is illustrated with a built-in corpus of news articles from two major U.S. journals to forecast the CBOE Volatility Index.
Introducing Myself
I decided to sign up to Medium "by the other side" with the aim to publish my AI for Finance projects and empower my knowledge in these sectors, thanks to this great community! I love to analyse stocks and alternative assets prices and making inference using regressions, ensemble methods and sentiment analysis. I may still be not so capable of using Medium but I'll give it a shot! I'm so excited to start this!
Improving Compositional Generalization with Self-Training for Data-to-Text Generation
Mehta, Sanket Vaibhav, Rao, Jinfeng, Tay, Yi, Kale, Mihir, Parikh, Ankur, Zhong, Hongtao, Strubell, Emma
Data-to-text generation focuses on generating fluent natural language responses from structured semantic representations. Such representations are compositional, allowing for the combination of atomic meaning schemata in various ways to express the rich semantics in natural language. Recently, pretrained language models (LMs) have achieved impressive results on data-to-text tasks, though it remains unclear the extent to which these LMs generalize to new semantic representations. In this work, we systematically study the compositional generalization of current state-of-the-art generation models in data-to-text tasks. By simulating structural shifts in the compositional Weather dataset, we show that T5 models fail to generalize to unseen structures. Next, we show that template-based input representations greatly improve the model performance and model scale does not trivially solve the lack of generalization. To further improve the model's performance, we propose an approach based on self-training using finetuned BLEURT for pseudo-response selection. Extensive experiments on the few-shot Weather and multi-domain SGD datasets demonstrate strong gains of our method.
Sentimental Analysis in Machine Learning
Sentimental Analysis helps in quickly analyzing the numerous amount of data. Since Artificial Intelligence and its advanced technologies have started influencing different sectors. A lot of research work is taking place for developing different tools that can evolve Artificial Intelligence and Machine Learning more stronger. And Sentiment Analysis is one such topic that has created a buzz in the field of scientific and market research in the field of Natural Language Processing and Machine Learning with the help of its amazing applications. Basically, Sentiment Analysis is a Machine Learning tool.
Few-Shot Bot: Prompt-Based Learning for Dialogue Systems
Madotto, Andrea, Lin, Zhaojiang, Winata, Genta Indra, Fung, Pascale
Learning to converse using only a few examples is a great challenge in conversational AI. The current best conversational models, which are either good chit-chatters (e.g., BlenderBot) or goal-oriented systems (e.g., MinTL), are language models (LMs) fine-tuned on large conversational datasets. Training these models is expensive, both in terms of computational resources and time, and it is hard to keep them up to date with new conversational skills. A simple yet unexplored solution is prompt-based few-shot learning (Brown et al. 2020) which does not require gradient-based fine-tuning but instead uses a few examples in the LM context as the only source of learning. In this paper, we explore prompt-based few-shot learning in dialogue tasks. We benchmark LMs of different sizes in nine response generation tasks, which include four knowledge-grounded tasks, a task-oriented generations task, three open-chat tasks, and controlled stylistic generation, and five conversational parsing tasks, which include dialogue state tracking, graph path generation, persona information extraction, document retrieval, and internet query generation. The current largest released LM (GPT-J-6B) using prompt-based few-shot learning, and thus requiring no training, achieves competitive performance to fully trained state-of-the-art models. Moreover, we propose a novel prompt-based few-shot classifier, that also does not require any fine-tuning, to select the most appropriate prompt given a dialogue history. Finally, by combining the power of prompt-based few-shot learning and a Skill Selector, we create an end-to-end chatbot named the Few-Shot Bot (FSB), which automatically selects the most appropriate conversational skill, queries different knowledge bases or the internet, and uses the retrieved knowledge to generate a human-like response, all using only few dialogue examples per skill.
GlobalWoZ: Globalizing MultiWoZ to Develop Multilingual Task-Oriented Dialogue Systems
Ding, Bosheng, Hu, Junjie, Bing, Lidong, Aljunied, Sharifah Mahani, Joty, Shafiq, Si, Luo, Miao, Chunyan
Much recent progress in task-oriented dialogue (ToD) systems has been driven by available annotation data across multiple domains for training. Over the last few years, there has been a move towards data curation for multilingual ToD systems that are applicable to serve people speaking different languages. However, existing multilingual ToD datasets either have a limited coverage of languages due to the high cost of data curation, or ignore the fact that dialogue entities barely exist in countries speaking these languages. To tackle these limitations, we introduce a novel data curation method that generates GlobalWoZ -- a large-scale multilingual ToD dataset globalized from an English ToD dataset for three unexplored use cases. Our method is based on translating dialogue templates and filling them with local entities in the target-language countries. We release our dataset as well as a set of strong baselines to encourage research on learning multilingual ToD systems for real use cases.