AITopics

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.58)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.58)

Walker, Marilyn, Harmon, Colin, Graupera, James, Harrison, Davan, Whittaker, Steve

Modeling Performance in Open-Domain Dialogue with PARADISE

arXiv.org Artificial IntelligenceOct-21-2021

There has recently been an explosion of work on spoken dialogue systems, along with an increased interest in open-domain systems that engage in casual conversations on popular topics such as movies, books and music. These systems aim to socially engage, entertain, and even empathize with their users. Since the achievement of such social goals is hard to measure, recent research has used dialogue length or human ratings as evaluation metrics, and developed methods for automatically calculating novel metrics, such as coherence, consistency, relevance and engagement. Here we develop a PARADISE model for predicting the performance of Athena, a dialogue system that has participated in thousands of conversations with real users, while competing as a finalist in the Alexa Prize. We use both user ratings and dialogue length as metrics for dialogue quality, and experiment with predicting these metrics using automatic features that are both system dependent and independent. Our goal is to learn a general objective function that can be used to optimize the dialogue choices of any Alexa Prize system in real time and evaluate its performance. Our best model for predicting user ratings gets an R$^2$ of .136 with a DistilBert model, and the best model for predicting length with system independent features gets an R$^2$ of .865, suggesting that conversation length may be a more reliable measure for automatic training of dialogue systems.

conversation length, dialogue system, walker, (15 more...)

2110.11164

Country: North America > United States > California > Santa Cruz County > Santa Cruz (0.14)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment (0.93)
Media > Film (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
(2 more...)

#artificialintelligenceOct-20-2021, 23:06:30 GMT

Sentiment Analysis with Scikit-learn and GCP

For this project, I wanted to design a model that would do a simple classification of whether a phrase is positive or negative. Since I'm only looking for a binary result, I chose to use Sklearn's logistic regression module. If you were trying to predict more than two labels, you would have to use a different ML model. The data used is a corpus of 5,000 movie reviews -- 2,500 positive and 2,500 negative. The model has an accuracy of 90% and probably performs better with text that is similar to a review because it would more like the training data.

scikit-learn and gcp, training data, vectorizer, (7 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.42)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.42)

arXiv.org Artificial IntelligenceOct-20-2021

Distributionally Robust Classifiers in Sentiment Analysis

Li, Shilun, Li, Renee, Zhang, Carina

In this paper, we propose sentiment classification models based on BERT integrated with DRO (Distributionally Robust Classifiers) to improve model performance on datasets with distributional shifts. We added 2-Layer Bi-LSTM, projection layer (onto simplex or Lp ball), and linear layer on top of BERT to achieve distributionally robustness. We considered one form of distributional shift (from IMDb dataset to Rotten Tomatoes dataset). We have confirmed through experiments that our DRO model does improve performance on our test set with distributional shift from the training set.

dataset, distributional shift, test accuracy, (12 more...)

2110.10372

Country: North America > United States > California > Santa Clara County > Palo Alto (0.05)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.86)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.86)

arXiv.org Machine LearningOct-20-2021

The R package sentometrics to compute, aggregate and predict with textual sentiment

Ardia, David, Bluteau, Keven, Borms, Samuel, Boudt, Kris

We provide a hands-on introduction to optimized textual sentiment indexation using the R package sentometrics. Textual sentiment analysis is increasingly used to unlock the potential information value of textual data. The sentometrics package implements an intuitive framework to efficiently compute sentiment scores of numerous texts, to aggregate the scores into multiple time series, and to use these time series to predict other variables. The workflow of the package is illustrated with a built-in corpus of news articles from two major U.S. journals to forecast the CBOE Volatility Index.

corpus, lexicon, sentiment, (16 more...)

arXiv.org Machine Learning

doi: 10.18637/jss.v099.i02

2110.10817

Country:

Europe > Austria > Vienna (0.14)
North America > Canada > Quebec > Montreal (0.04)
Europe > Belgium (0.04)
(7 more...)

Genre: Workflow (0.89)

Industry:

Banking & Finance > Trading (1.00)
Banking & Finance > Economy (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.66)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.48)

#artificialintelligenceOct-18-2021, 19:15:31 GMT

Introducing Myself

I decided to sign up to Medium "by the other side" with the aim to publish my AI for Finance projects and empower my knowledge in these sectors, thanks to this great community! I love to analyse stocks and alternative assets prices and making inference using regressions, ensemble methods and sentiment analysis. I may still be not so capable of using Medium but I'll give it a shot! I'm so excited to start this!

Technology:

Information Technology > Communications > Social Media (0.34)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.34)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.34)

arXiv.org Artificial IntelligenceOct-16-2021

Improving Compositional Generalization with Self-Training for Data-to-Text Generation

Mehta, Sanket Vaibhav, Rao, Jinfeng, Tay, Yi, Kale, Mihir, Parikh, Ankur, Zhong, Hongtao, Strubell, Emma

Data-to-text generation focuses on generating fluent natural language responses from structured semantic representations. Such representations are compositional, allowing for the combination of atomic meaning schemata in various ways to express the rich semantics in natural language. Recently, pretrained language models (LMs) have achieved impressive results on data-to-text tasks, though it remains unclear the extent to which these LMs generalize to new semantic representations. In this work, we systematically study the compositional generalization of current state-of-the-art generation models in data-to-text tasks. By simulating structural shifts in the compositional Weather dataset, we show that T5 models fail to generalize to unseen structures. Next, we show that template-based input representations greatly improve the model performance and model scale does not trivially solve the lack of generalization. To further improve the model's performance, we propose an approach based on self-training using finetuned BLEURT for pseudo-response selection. Extensive experiments on the few-shot Weather and multi-domain SGD datasets demonstrate strong gains of our method.

dataset, generalization, representation, (15 more...)

2110.08467

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.05)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.47)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.46)

#artificialintelligenceOct-15-2021, 01:10:47 GMT

Sentimental Analysis in Machine Learning

Sentimental Analysis helps in quickly analyzing the numerous amount of data. Since Artificial Intelligence and its advanced technologies have started influencing different sectors. A lot of research work is taking place for developing different tools that can evolve Artificial Intelligence and Machine Learning more stronger. And Sentiment Analysis is one such topic that has created a buzz in the field of scientific and market research in the field of Natural Language Processing and Machine Learning with the help of its amazing applications. Basically, Sentiment Analysis is a Machine Learning tool.

algorithm, machine learning, sentimental analysis, (4 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.52)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.52)

Madotto, Andrea, Lin, Zhaojiang, Winata, Genta Indra, Fung, Pascale

Few-Shot Bot: Prompt-Based Learning for Dialogue Systems

arXiv.org Artificial IntelligenceOct-15-2021

Learning to converse using only a few examples is a great challenge in conversational AI. The current best conversational models, which are either good chit-chatters (e.g., BlenderBot) or goal-oriented systems (e.g., MinTL), are language models (LMs) fine-tuned on large conversational datasets. Training these models is expensive, both in terms of computational resources and time, and it is hard to keep them up to date with new conversational skills. A simple yet unexplored solution is prompt-based few-shot learning (Brown et al. 2020) which does not require gradient-based fine-tuning but instead uses a few examples in the LM context as the only source of learning. In this paper, we explore prompt-based few-shot learning in dialogue tasks. We benchmark LMs of different sizes in nine response generation tasks, which include four knowledge-grounded tasks, a task-oriented generations task, three open-chat tasks, and controlled stylistic generation, and five conversational parsing tasks, which include dialogue state tracking, graph path generation, persona information extraction, document retrieval, and internet query generation. The current largest released LM (GPT-J-6B) using prompt-based few-shot learning, and thus requiring no training, achieves competitive performance to fully trained state-of-the-art models. Moreover, we propose a novel prompt-based few-shot classifier, that also does not require any fine-tuning, to select the most appropriate prompt given a dialogue history. Finally, by combining the power of prompt-based few-shot learning and a Skill Selector, we create an end-to-end chatbot named the Few-Shot Bot (FSB), which automatically selects the most appropriate conversational skill, queries different knowledge bases or the internet, and uses the retrieved knowledge to generate a human-like response, all using only few dialogue examples per skill.

arxiv preprint arxiv, dataset, dialogue, (9 more...)

2110.08118

Country:

North America > United States > New York (0.04)
North America > United States > Ohio (0.04)
North America > United States > District of Columbia > Washington (0.04)
(4 more...)

Genre: Research Report > Promising Solution (0.34)

Industry:

Retail (0.93)
Leisure & Entertainment (0.67)
Transportation > Ground (0.46)
Media > Film (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceOct-14-2021

GlobalWoZ: Globalizing MultiWoZ to Develop Multilingual Task-Oriented Dialogue Systems

Ding, Bosheng, Hu, Junjie, Bing, Lidong, Aljunied, Sharifah Mahani, Joty, Shafiq, Si, Luo, Miao, Chunyan

Much recent progress in task-oriented dialogue (ToD) systems has been driven by available annotation data across multiple domains for training. Over the last few years, there has been a move towards data curation for multilingual ToD systems that are applicable to serve people speaking different languages. However, existing multilingual ToD datasets either have a limited coverage of languages due to the high cost of data curation, or ignore the fact that dialogue entities barely exist in countries speaking these languages. To tackle these limitations, we introduce a novel data curation method that generates GlobalWoZ -- a large-scale multilingual ToD dataset globalized from an English ToD dataset for three unexplored use cases. Our method is based on translating dialogue templates and filling them with local entities in the target-language countries. We release our dataset as well as a set of strong baselines to encourage research on learning multilingual ToD systems for real use cases.

target language, test data, use case, (16 more...)

2110.07679

Country:

Asia > China > Shanghai > Shanghai (0.05)
Asia > Singapore (0.04)
Europe > Germany > Saarland > Saarbrücken (0.04)
(20 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Speech (0.93)
(2 more...)