Goto

Collaborating Authors

 Discourse & Dialogue


Sentiment analysis with machine learning in R

#artificialintelligence

Machine learning makes sentiment analysis more convenient. This post would introduce how to do sentiment analysis with machine learning using R. In the landscape of R, the sentiment R package and the more general text mining package have been well developed by Timothy P. Jurka. You can check out the sentiment package and the fantastic RTextTools package. Actually, Timothy also writes an maxent package for low-memory multinomial logistic regression (also known as maximum entropy).


Sentiment analysis - A case study on Flipkart and Snapdeal on World Book Day - ParallelDots

#artificialintelligence

With the big data growing bigger and bigger and social media penetrating every facet of the society, construing and monitoring data is one of the biggest challenges faced by the enterprises. Gone are those days when customers have to lodge a formal complaint to register the malfunctioning of any product/services provided by the business enterprise, rather, users these days take it to the social media forum to express their dissatisfaction and anguish towards any improper services/products. Inputs such as tweets, facebook comments could be of significant value to the enterprise to analyze their products/services/ performances, customer behavior and demands. Below is a small case study on Flipkart and Snapdeal performance when'World Book Day' was trending on Twitter. Below is the screenshot of'Flipkart' and'Snapdeal' on the occasion of'World Book Day'.


SuperBowl XLIX in Tweets: Sentiment Analysis of 4 Million Tweets

@machinelearnbot

This blog was originally published on our Text Analysis blog, the blog post set out to analyze and visualize 4 million tweets collected during Superbowl XLIX. Not surprisingly, Superbowl XLIX generated a huge amount of chatter on social networks with Twitter Estimating that over 28.4 million posts made with terms relating to the Superbowl. At AYLIEN, we collected just under 4 million Tweets from Hashtags, Handles and Keywords we were monitoring. To keep our sample clean, we removed any reTweets and spam from the Tweets collected and only worked with those Tweets that were written in English. We were left with about 3.5 million Tweets to play with.



Learn Everything about Sentiment Analysis using R

#artificialintelligence

For our case we only consider Text feature of the Tweet as we are interested on the review of the movie. We can also use the other features such as Latitude/Longitude, replied to, etc. do other analysis on the tweeted data.


indico Named Boston's Best Tech Startup at 2nd Annual Timmy Awards

#artificialintelligence

About indico indico provides state-of-the-art machine learning algorithms for text and image analysis in the form of a simple to use web service. This, for the first time, enables companies to automatically extract meaningful insight from unstructured data regardless of their size or capability. Sentiment Analysis, Social Media Monitoring, Content Filtering, Content Classification, Recommendation, and Personalization are just some of the areas in which indico's customers are deploying its technology to improve business outcomes. Furthermore, indico's rapid customization capabilities have also enabled companies such as Mavrck, CO Everywhere, and interlinkONE to quickly develop compelling new solutions that weren't practical before.


Temporal Topic Analysis with Endogenous and Exogenous Processes

AAAI Conferences

We consider the problem of modeling temporal textual data taking endogenous and exogenous processes into account. Such text documents arise in real world applications, including job advertisements and economic news articles, which are influenced by the fluctuations of the general economy. We propose a hierarchical Bayesian topic model which imposes a "group-correlated" hierarchical structure on the evolution of topics over time incorporating both processes, and show that this model can be estimated from Markov chain Monte Carlo sampling methods. We further demonstrate that this model captures the intrinsic relationships between the topic distribution and the time-dependent factors, and compare its performance with latent Dirichlet allocation (LDA) and two other related models. The model is applied to two collections of documents to illustrate its empirical performance: online job advertisements from DirectEmployers Association and journalists' postings on BusinessInsider.com.


Gated Neural Networks for Targeted Sentiment Analysis

AAAI Conferences

Targeted sentiment analysis classifies the sentiment polarity towards each target entity mention in given text documents. Seminal methods extract manual discrete features from automatic syntactic parse trees in order to capture semantic information of the enclosing sentence with respect to a target entity mention. Recently, it has been shown that competitive accuracies can be achieved without using syntactic parsers, which can be highly inaccurate on noisy text such as tweets. This is achieved by applying distributed word representations and rich neural pooling functions over a simple and intuitive segmentation of tweets according to target entity mentions. In this paper, we extend this idea by proposing a sentence-level neural model to address the limitation of pooling functions, which do not explicitly model tweet-level semantics. First, a bi-directional gated neural network is used to connect the words in a tweet so that pooling functions can be applied over the hidden layer instead of words for better representing the target and its contexts. Second, a three-way gated neural network structure is used to model the interaction between the target mention and its surrounding contexts. Experiments show that our proposed model gives significantly higher accuracies compared to the current best method for targeted sentiment analysis.


Collective Supervision of Topic Models for Predicting Surveys with Social Media

AAAI Conferences

This paper considers survey prediction from social media. We use topic models to correlate social media messages with survey outcomes and to provide an interpretable representation of the data. Rather than rely on fully unsupervised topic models, we use existing aggregated survey data to inform the inferred topics, a class of topic model supervision referred to as collective supervision. We introduce and explore a variety of topic model variants and provide an empirical analysis, with conclusions of the most effective models for this task.


Natural Language Processing for Enhancing Teaching and Learning

AAAI Conferences

Advances in natural language processing (NLP) and educational technology, as well as the availability of unprecedented amounts of educationally-relevant text and speech data, have led to an increasing interest in using NLP to address the needs of teachers and students. Educational applications differ in many ways, however, from the types of applications for which NLP systems are typically developed. This paper will organize and give an overview of research in this area, focusing on opportunities as well as challenges.