Discourse & Dialogue
Sales forecasts: how to improve accuracy while simplifying models?
Identify the top four metrics that drive sales among the metrics that I have suggested in this article (by all means, please do not ignore external data sources - including a sentiment analysis index by product, that is, what your customers write about your products on Twitter), and create a simple regression model. You could get it done with Excel (use the data analysis plug-in or the linest functions) and get better forecasts than using a much more sophisticated model based only on internal data coming from just one of your many silos. Get confidence intervals for your sales forecasts: more about this in a few days; I will provide a very simple, model-free, data-driven solution to compute confidence intervals.
The importance of Neutral Class in Sentiment Analysis
Sentiment Analysis (detecting document's polarity, subjectivity and emotional states) is a difficult problem and several times I bumped into unexpected and interesting results. One of the strangest things that I found is that despite the fact that neutral class can improve under specific conditions the classification accuracy, it is often ignored by most researchers.
Global Bigdata Conference
As it turns out, other techniques including website path analysis, text analysis of customer feedback, sentiment analysis of social media, and graph analysis --all distinctly different analytics techniques with each delivering insights complementing the others--revealed a fuller picture: people weren't complaining about price, preferring the cheaper item, or any of the things that the retailer expected. Instead, customers were complaining about how hard it was to find designer jeans on the website. It was a website navigation issue. And the issue was invisible until the retailer made sense of analytics from a variety of sources.
Semantic Properties of Customer Sentiment in Tweets
An increasing number of people are using online social networking services (SNSs), and a significant amount of information related to experiences in consumption is shared in this new media form. Text mining is an emerging technique for mining useful information from the web. We aim at discovering in particular tweets semantic patterns in consumers' discussions on social media. Specifically, the purposes of this study are twofold: 1) finding similarity and dissimilarity between two sets of textual documents that include consumers' sentiment polarities, two forms of positive vs. negative opinions and 2) driving actual content from the textual data that has a semantic trend. The considered tweets include consumers opinions on US retail companies (e.g., Amazon, Walmart). Cosine similarity and K-means clustering methods are used to achieve the former goal, and Latent Dirichlet Allocation (LDA), a popular topic modeling algorithm, is used for the latter purpose. This is the first study which discover semantic properties of textual data in consumption context beyond sentiment analysis. In addition to major findings, we apply LDA (Latent Dirichlet Allocations) to the same data and drew latent topics that represent consumers' positive opinions and negative opinions on social media.
How to Transform your Google Spreadsheet Into an Opinion Mining Tool
This blog was originally featured on blog.aylien.com, a Text Analysis blog with tutorials, Data Visualisations and industry discussions. Our founder, Parsa Ghaffari, gave a talk recently on Natural Language Processing and Sentiment Analysis at the Science Gallery in Dublin. As part of the talk, he put together a nice little example of how you can transform your Google Spreadsheet into a powerful Text Analysis and Data Mining tool. In this case, he took a simple example of analyzing restaurant reviews from a popular review site but the same could be done for hotels, products, service offerings and so on. He wanted to show how easy it can be for data geeks and even the less technical marketers among us, to start analyzing text and gathering business insight from the reams of textual data online today.
Tool for Computing Continuous Distributed Representations of Words
Natural language processing (NLP) involves machine learning, artificial intelligence, algorithms and linguistics related to interactions between computers and human languages. One important goal of NLP is to design and build software that will understand and analyze human languages to simplify and optimize human - computer communication. NLP algorithms are usually based on probability theory and machine learning grounded in statistical inference -- to automatically learn rules through analysis of real-world usage. It includes word and sentence tokenization, text classification and sentiment analysis, spelling correction, information extraction, parsing, meaning extraction, question answering and requires both syntactic and semantic analysis at various levels. NLP applications today involve spelling and grammar correction in word processors, machine translation, sentiment analysis and email spam detection.
Latent Dirichlet Allocation Using Gibbs Sampling
Text clustering is a widely used techniques to automatically draw out patterns from a set of documents. This notion can be extended to customer segmentation in the digital marketing field. As one of its main core is to understand what drives visitors to come, leave and behave on site. One simple way to do this is by reviewing words that they used to arrive on site and what words they used ( what things they searched) once they're on your site. Another usage of text clustering is for document organization or indexing (tagging).
Global Brain That Makes You Think Twice
Rzepka, Rafal (Hokkaido University) | Mazur, Michal (Hokkaido University) | Clapp, Austin (Stanford University) | Araki, Kenji (Hokkaido University)
In this position paper we introduce our approach to positive computing by developing and integrating methods for future assistant and companion agents which could help us a) avoid making mistakes due to biases caused by insufficient knowledge, b) be more empathic and righteous, c) be more sensitive and thoughtful. We present text processing techniques for automatic discovery of possible reasoning errors and provide hints to make users doubt their beliefs when there is a possibility of harm. We present existing sources and methods, discuss on how natural language processing technologies could contribute to various aspects of well-being by giving examples of systems we develop, and describe the strengths and weaknesses of our approach.