AITopics | Discourse & Dialogue

Collaborating Authors

Discourse & Dialogue

Understanding Language in Conversations "The problems addressed in discourse research aim to answer two general kinds of questions: (1) what information is contained in extended sequences of utterances that goes beyond the meaning of the individual utterances themselves? (2) how does the context in which an utterance is used affect the meaning of the individual utterances, or parts of them?"
– Barbara Grosz. Overview of Chapter 6: Discourse and Dialogue, Survey of the State of the Art in Human Language Technology (1996).

News Overviews Instructional Materials AI-Alerts Classics

How Important Is Size? An Investigation of Corpus Size and Meaning in Both Latent Semantic Analysis and Latent Dirichlet Allocation

Crossley, Scott (Georgia State University) | Dascalu, Mihai (University Politehnica of Bucharest) | McNamara, Danielle (Arizona State University)

AAAI ConferencesMay-16-2017

This study examines how differences in corpus size influence the accuracy of Latent Semantic Analysis (LSA) spaces and Latent Dirichlet Allocation (LDA) spaces in two tasks: a word association task and a vocabulary definition test. Specific optimizations were considered in building each semantic model. Initial results indicate that larger corpora lead to greater accuracy and that LDA probabilistic models, similar to LSA vector spaces, can provide insights into cognitive processing at semantic levels.

analysis and latent dirichlet allocation, investigation, latent semantic analysis, (1 more...)

AAAI Conferences

The Thirtieth International Flairs Conference

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.89)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.60)

Add feedback

An Efficient Deep Neural Architecture for Multilingual Sentiment Analysis in Twitter

Becker, Willian (Pontifícia Universidade Católica do Rio Grande do Sul) | Wehrmann, Jônatas (Pontifícia Universidade Católica do Rio Grande do Sul) | Cagnini, Henry E. L. (Pontifícia Universidade Católica do Rio Grande do Sul) | Barros, Rodrigo C. (Pontifícia Universidade Católica do Rio Grande do Sul)

AAAI ConferencesMay-16-2017

Sentiment analysis of tweets is often monolingual and the models provided by machine learning classifiers are usually not applicable across distinct languages. Cross-language sentiment classification usually relies on machine translation strategies in which a source language is translated to the desired target language. Machine translation is costly and the provided results are limited by the quality of the translation that is performed. In this paper, we propose an efficient translation-free deep neural architecture for performing multilingual sentiment analysis of tweets. Our proposed approach benefits from a cost-effective character-based embedding and from optimized convolutions to learn from multiple distinct languages. The resulting model is capable of learning latent features from all languages used during training at once and it does not require any translation process to be performed whatsoever. We empirically evaluate the efficiency and effectiveness of the proposed approach in tweet corpora from four different languages and we show that it presents the best trade-off among four distinct state-of-the-art deep neural architectures for sentiment analysis.

Add feedback

Predicting Movie Ratings: NLP Tools is What Film Studios Need

#artificialintelligenceMay-15-2017, 01:15:09 GMT

She writes about software development, UI and UX, natural language processing, Big Data, AI, and other IT-related topics.

artificial intelligence, natural language, text processing, (17 more...)

#artificialintelligence

Country: North America (0.05)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.70)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.51)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.51)

Add feedback

Sentiment Analysis & Predictive Analytics for trading. Avoid this systematic mistake

@machinelearnbotMay-13-2017, 16:20:06 GMT

Many common mistakes can be avoided when testing sentiment data for predictive properties. The term "prediction" is not a legal definition. In assessing the predictive qualities of sentiment data there are no rules for what counts as a signal to be tested for predictive properties with regard to financial assets. However, the method you chose ultimately defines what you mean with the term "prediction". To illustrate the point: Using a more prudent definition of the term, the accuracy in the world's most famous prediction study could have been as low as 47% (7 out of 15) instead of 87% (13 out of 15%). An accuracy rate of 47% would not have produced worldwide media attention and more than 1600 academic citations, in my view.

artificial intelligence, data mining, natural language, (17 more...)

@machinelearnbot

Industry: Banking & Finance > Trading (0.52)

Technology:

Information Technology > Data Science > Data Mining (0.78)
Information Technology > Communications > Social Media (0.67)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.40)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.40)

Add feedback

A Sentiment Analysis System to Improve Teaching and Learning

IEEE ComputerMay-10-2017, 21:40:10 GMT

Natural language processing and machine learning can be applied to student feedback to help university administrators and teachers address problematic areas in teaching and learning. The proposed system analyzes student comments from both course surveys and online sources to identify sentiment polarity, the emotions expressed, and satisfaction versus dissatisfaction. A comparison with direct-assessment results demonstrates the system's reliability.

artificial intelligence, natural language, sentiment analysis system, (2 more...)

IEEE Computer

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.58)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.58)

Add feedback

People on Media: Jointly Identifying Credible News and Trustworthy Citizen Journalists in Online Communities

Mukherjee, Subhabrata, Weikum, Gerhard

arXiv.org Machine LearningMay-9-2017

Media seems to have become more partisan, often providing a biased coverage of news catering to the interest of specific groups. It is therefore essential to identify credible information content that provides an objective narrative of an event. News communities such as digg, reddit, or newstrust offer recommendations, reviews, quality ratings, and further insights on journalistic works. However, there is a complex interaction between different factors in such online communities: fairness and style of reporting, language clarity and objectivity, topical perspectives (like political viewpoint), expertise and bias of community members, and more. This paper presents a model to systematically analyze the different interactions in a news community between users, news, and sources. We develop a probabilistic graphical model that leverages this joint interaction to identify 1) highly credible news articles, 2) trustworthy news sources, and 3) expert users who perform the role of "citizen journalists" in the community. Our method extends CRF models to incorporate real-valued ratings, as some communities have very fine-grained scales that cannot be easily discretized without losing information. To the best of our knowledge, this paper is the first full-fledged analysis of credibility, trust, and expertise in news communities.

machine learning, natural language, news article, (21 more...)

arXiv.org Machine Learning

doi: 10.1145/2806416.2806537

1705.02667

Country:

North America > United States (0.93)
Asia (0.67)

Genre: Research Report (1.00)

Industry:

Media > News (1.00)
Government > Regional Government > North America Government > United States Government (0.67)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.68)

Add feedback

Credible Review Detection with Limited Information using Consistency Analysis

Mukherjee, Subhabrata, Dutta, Sourav, Weikum, Gerhard

arXiv.org Machine LearningMay-7-2017

Online reviews provide viewpoints on the strengths and shortcomings of products/services, influencing potential customers' purchasing decisions. However, the proliferation of non-credible reviews -- either fake (promoting/ demoting an item), incompetent (involving irrelevant aspects), or biased -- entails the problem of identifying credible reviews. Prior works involve classifiers harnessing rich information about items/users -- which might not be readily available in several domains -- that provide only limited interpretability as to why a review is deemed non-credible. This paper presents a novel approach to address the above issues. We utilize latent topic models leveraging review texts, item ratings, and timestamps to derive consistency features without relying on item/user histories, unavailable for "long-tail" items/users. We develop models, for computing review credibility scores to provide interpretable evidence for non-credible reviews, that are also transferable to other domains -- addressing the scarcity of labeled data. Experiments on real-world datasets demonstrate improvements over state-of-the-art baselines.

amazon, machine learning, natural language, (20 more...)

arXiv.org Machine Learning

1705.02668

Genre: Research Report (1.00)

Industry: Consumer Products & Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.88)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.46)

Add feedback

People on Drugs: Credibility of User Statements in Health Communities

Mukherjee, Subhabrata, Weikum, Gerhard, Danescu-Niculescu-Mizil, Cristian

arXiv.org Machine LearningMay-6-2017

Online health communities are a valuable source of information for patients and physicians. However, such user-generated resources are often plagued by inaccuracies and misinformation. In this work we propose a method for automatically establishing the credibility of user-generated medical statements and the trustworthiness of their authors by exploiting linguistic cues and distant supervision from expert sources. To this end we introduce a probabilistic graphical model that jointly learns user trustworthiness, statement credibility, and language objectivity. We apply this methodology to the task of extracting rare or unknown side-effects of medical drugs --- this being one of the problems where large scale non-expert data has the potential to complement expert medical knowledge. We show that our method can reliably extract side-effects and filter out false statements, while identifying trustworthy users that are likely to contribute valuable medical information.

data mining, machine learning, natural language, (22 more...)

arXiv.org Machine Learning

1705.02522

Genre: Research Report (0.82)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Consumer Health (1.00)
(3 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(5 more...)

Add feedback

Artificial Intelligence and Machine Learning Are Now Driving Marketing and Customer Engagement Activities

@machinelearnbotMay-5-2017, 12:45:44 GMT

As that "mention" gets pulled into the system, an AI called Natural Language Processing reads the post text and determines its "Sentiment." Sentiment is used to determine if the post is positive, neutral, or negative, (and in some advanced cases, the emotion like "anger" "sadness" or "joy"). Doing this manually for every post that comes in isn't feasible (we see tens of thousands of posts on any given week). AI does this automatically for us, and it can "learn" to improve its NLP Sentiment analysis as more posts pass through it, and as manual adjustments for errors are made. And speaking of Christian's post, he used the "#nofilter" hashtag which can be assigned a "proud" tag since its basically saying "my picture was so good, I didn't need to edit it."

artificial intelligence and machine learning, marketing and customer engagement activity, natural language, (2 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.31)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.31)

Add feedback

Filters

Collaborating Authors

Discourse & Dialogue

How Important Is Size? An Investigation of Corpus Size and Meaning in Both Latent Semantic Analysis and Latent Dirichlet Allocation

An Efficient Deep Neural Architecture for Multilingual Sentiment Analysis in Twitter

Predicting Movie Ratings: NLP Tools is What Film Studios Need

Sentiment Analysis & Predictive Analytics for trading. Avoid this systematic mistake

A Sentiment Analysis System to Improve Teaching and Learning

People on Media: Jointly Identifying Credible News and Trustworthy Citizen Journalists in Online Communities

Credible Review Detection with Limited Information using Consistency Analysis

People on Drugs: Credibility of User Statements in Health Communities

Artificial Intelligence and Machine Learning Are Now Driving Marketing and Customer Engagement Activities

Tutorial: Building a Twitter Sentiment Analysis Process