Europe
Twitter Sentiment Analysis: The Good the Bad and the OMG!
Kouloumpis, Efthymios (i-sieve Technologies) | Wilson, Theresa (Johns Hopkins University) | Moore, Johanna (University of Edinburgh)
In this paper, we investigate the utility of linguistic features for detecting the sentiment of Twitter messages. We evaluate the usefulness of existing lexical resources as well as features that capture information about the informal and creative language used in microblogging. We take a supervied approach to the problem, but leverage existing hashtags in the Twitter data for building training data.
Structure and Reciprocity in Technology-Centered Q&A Communities
Jiang, Ming (University of Michigan) | Dong, Tao (University of Michigan) | Chang, Yung-Ju (University of Michigan)
In this paper we examine the network structure of the MythTV mailing list, an online technology Q&A user community, and we use time-series analysis techniques to study users’ reciprocity behavior in this community. We find that the amount of help users provide is strongly correlated to the amount of help they receive. Further, by conducting the Granger Causality test on the time series data of active users’ activity, we find that the amount of help given is actually the reason why one gets a lot of help. This finding corresponds to the concept of directed reciprocity in social networks and provides insights into social dynamics in technology-centered online communities.
Identifying Users Across Social Tagging Systems
Iofciu, Tereza (Leibniz University Hannover) | Fankhauser, Peter (Leibniz University Hannover) | Abel, Fabian (TU Delft) | Bischoff, Kerstin (Leibniz University Hannover)
How much do tagging activities tell about a user? Is it possible to identify people in Delicious based on the tags, which they use in Flickr? In this paper we study those questions and investigate whether users can be identified across social tagging systems. We combine two kinds of information: their user ids and their tags. We introduce and compare a variety of approaches to measure the distance between user profiles for identification. With the best performing combination we achieve, depending on the actual settings, accuracies of between 60% and 80% which demonstrates that the traces of Web 2.0 users can reveal quite much about their identity.
Characterizing Social Relations Via NLP-Based Sentiment Analysis
Groh, Georg (TU Muenchen) | Hauffa, Jan (TU Muenchen)
We investigate and evaluate methods for the characterization of social relations from textual communication context, using e-mail as an example. Social relations are intrinsically characterized by the Cartesian product of weights on various axes (we employ valuation and intensity as examples). The prediction of these characteristics is performed by application of unsupervised learning algorithms on meta-data, communication statistics, and the results of deep linguistic analysis of the message body. Classification of sentiment polarity is chosen as the means of linguistic analysis. We find that prediction accuracy can be improved by introducing limited amounts of additional information.
Automatically Identifying Groups Based on Content and Collective Behavioral Patterns of Group Members
Gregory, Michelle (Pacific Northwest National Laboratory) | Engel, Dave W. (Pacific Northwest National Laboratory) | Bell, Eric (Pacific Northwest National Laboratory) | Piatt, Andy (Pacific Northwest National Laboratory) | Dowson, Scott (Pacific Northwest National Laboratory) | Cowell, Andrew (Pacific Northwest National Laboratory)
For example, on Live Journal1, there are a number of categories, gaming, for The explosion of popularity in social media, such as internet example, that one can categorize themselves and their forums, weblogs (blogs), wikis, etc., in the past decade blogs. While a number of those that self select that category has created a new opportunity to measure public opinion, may interact, there is no explicit requirement to do so. If attitude, and social structures (Agichtein et al. 2008, one is interested in marketing to a gaming crowd, for instance, Qualman 2010). A very common social structure investigated knowing all persons interested in gaming would be is online communities, or groups. There are a number useful, even if they do not interact directly with one another.
Limits of Electoral Predictions Using Twitter
Gayo-Avello, Daniel (Universidad de Oviedo) | Metaxas, Panagiotis Takis (Wellesley College) | Mustafaraj, Eni (Wellesley College)
Using social media for political discourse is becoming common practice, especially around election time. One interesting aspect of this trend is the possibility of pulsing the public’s opinion about the elections, and that has attracted the interest of many researchers and the press. Allegedly, predicting electoral outcomes from social media data can be feasible and even simple. Positive results have been reported, but without an analysis on what principle enables them. Our work puts to test the purported predictive power of socialmedia metrics against the 2010 US congressional elections. Here, we applied techniques that had reportedly led to positive election predictions in the past, on the Twitter data collected from the 2010 US congressional elections. Unfortunately, we find no correlation between the analysis results and the electoral outcomes, contradicting previous reports. Observing that 80 years of polling research would support our findings, we argue that one should not be accepting predictions about events using social media data as a black box. Instead, scholarly research should be accompanied by a model explaining the predictive power of social media, when there is one.
Automatic Group-Interactive Radio Using Social-Networks of Musicians
Fields, Ben (University of London) | Rhodes, Christophe (University of London) | d' (University of London) | Inverno, Mark
Using request radio shows as a base interactive model, we present the Steerable Optimizing Self-Organized Radio (SoSoRadio) system as a prototypical music rec- ommender system with robust automatic playlist gen- eration. This work describes a web-based radio system that interacts with current listeners through the selection of periodic request songs from a pool of nominees.
Using the H-Index to Estimate Blog Authority
Devezas, José (Labs SAPO/UP) | Nunes, Sérgio (Instituto de Engenharia de Sistemas e Computadores do Porto, Universidade do Porto) | Ribeiro, Cristina (Instituto de Engenharia de Sistemas e Computadores do Porto)
Link analysis is a technique frequently used in the ranking of web sites. On the web, we often encounter content that is organized by entries, sorted from recent to old, and generally follows the structure of a blog. In this paper we explore and evaluate the usage of a bibliometrics measure, called h-index, for the task of blog ranking, in an information retrieval context. We base our experiments on the TREC Blogs08 collection, which comprises over 28 million posts. The results obtained indicate that the h-index is a robust metric that allows for an improved relevance discrimination between blogs, when compared to the in-degree. Additionally, tests performed using distinct versions of the post graph, indicate that this metric might tolerate a certain level of link clutter.
Analyzing Political Trends in the Blogosphere
Demartini, Gianluca (L3S Research Center) | Siersdorfer, Stefan (L3S Research Center) | Chelaru, Sergiu (L3S Research Center) | Nejdl, Wolfgang (L3S Research Center)
In the last years, the blogosphere has become a vital part of the web, covering a variety of different points of view and opinions on political and event-related topics such as immigration, election campaigns, or economic developments. Tracking the public opinion is usually done by conducting surveys resulting in significant costs both for interviewers and persons consulted. In this paper, we propose a method for extracting political trends in the blogosphere.To this end, we apply sentiment and time series analysis techniques in combination with aggregation methods on blog data to estimate the temporal development of opinions on politicians.
Improving Text Clustering with Social Tagging
Ares, M. Eduardo (University of A Coruña) | Parapar, Javier (University of A Coruña) | Barreiro, Álvaro (University of A Coruña)
Another important question is the absoluteness of the constraints. Lately several web-based tagging systems such as Technorati, Even if we use this approach to turn tags into constraints, Flickr or Delicious have become very popular. In this a fair amount of them are bound to be inaccurate paper we will exploit the information created by the community (i.e., linking documents which should not be in the same in Delicious: a social bookmarking service where cluster) until a high value of the parameter t, due to the polysemy the users can save the URLs of their favourite webpages of the terms used as tags or to differences in the criteria offering also the possibility of associating tags to them. of the taggers. Consequently, we have used soft positive On the other hand the clustering methods are a very important constraints, meaning that the documents affected by one of data mining tool in order to exploit the knowledge them are likely to be in the same cluster, without forcing the present in data collections. In the last years a new family of clustering algorithm to actually put them so.