Country
Areca: Online Comparison of Research Results
Urbansky, David (Dresden University of Technology) | Muthmann, Klemens (Dresden University of Technology) | Kreisz, Lars (Dresden University of Technology) | Schill, Alexander (Dresden University of Technology)
To experiment properly, scientists from many researchareas need large sets of real world data. Information re-trieval scientists for example often need to evaluate theiralgorithms on a dataset or a gold standard. The availabil-ity of these datasets often is insuf๏ฌcient and authors withthe same goal do not evaluate their approaches on thesame data. To make research results more transparentand comparable, we introduce Areca, an online portalfor sharing datasets and/or the results that were reachedwith the authorโs algorithms on these datasets. Havingsuch an online comparison makes it easier to grasp thestate-of-the-art on certain tasks and drive research toimprove the results.
TweetTrader.net: Leveraging Crowd Wisdom in a Stock Microblogging Forum
Sprenger, Timm Oliver (Technische Universität München)
TweetTrader.net is a stock microblogging forum that leverages the wisdom of crowds to aggregate the information contained in stock-related tweets. Based on insights from academic research on stock microblogs, the application integrates inputs from text classification, user voting and a proprietary Stock Game in order to extract the sentiment (i.e., the bullishness) of online investors with respect to all publicly traded companies of the S&P 500.
Making Project Team Recommendations from Online Information Sources
Earl, Charles C. (Virkaz Technologies) | Johnson, Amos (Morehouse College) | Yelpaala, Kaakpema (Yelpaala Good Advisors) | Good, Travis (Yelpaala Good Advisors)
We are developing an Internet platform called MediaTeam that provides a marketplace connecting media content consumers to communities of media content creators. The platform is enabled by our method for automated assembly of virtual project teams. Media creators use the automated team assembler to quickly identify and team with collaborators. The team assembly platform factors in how the skills, work, and communication styles of team members complement each other into its team recommendation process. We are now testing the teaming and collaboration platforms with video creators and seek to launch by the summer.
Does Bad News Go Away Faster?
Wu, Shaomei (Cornell University) | Tan, Chenhao (Cornell University) | Kleinberg, Jon (Cornell University) | Macy, Michael Walton (Cornell University)
We study the relationship between content and temporal dynamics of information on Twitter, focusing on the persistence of information. We compare two extreme temporal patterns in the decay rate of URLs embedded in tweets, defining a prediction task to distinguish between URLs that fade rapidly following their peak of popularity and those that fade more slowly. Our experiments show a strong association between the content and the temporal dynamics of information: given unigram features extracted from corresponding HTML webpages, a linear SVM classifier can predict the temporal pattern of URLs with high accuracy. We further explore the content of URLs in the two temporal classes using various textual analysis techniques (via LIWC and trend detection). We find that the rapidly-fading information contains significantly more words related to negative emotion, actions, and more complicated cognitive processes, whereas the persistent information contains more words related to positive emotion, leisure, and lifestyle.
Personalized Landmark Recommendation Based on Geotags from Photo Sharing Sites
Shi, Yue (Delft University of Technology) | Serdyukov, Pavel (Yandex) | Hanjalic, Alan (Delft University of Technology) | Larson, Martha (Delft University of Technology)
Geotagged photos of users on social media sites provide abundant location-based data, which can be exploited for various location-based services, such as travel recommendation. In this paper, we propose a novel approach to a new application, i.e., personalized landmark recommendation based on usersโ geotagged photos. We formulate the landmark recommendation task as a collaborative filtering problem, for which we propose a category-regularized matrix factorization approach that integrates both user-landmark preference and category-based landmark similarity. We collected geotagged photos from Flickr and landmark categories from Wikipedia for our experiments. Our experimental results demonstrate that the proposed approach outperforms popularity-based landmark recommendation and a basic matrix factorization approach in recommending personalized landmarks that are less visited by the population as a whole.
Hierarchical Bayesian Models for Latent Attribute Detection in Social Media
Rao, Delip (Johns Hopkins University) | Paul, Michael (Johns Hopkins University) | Fink, Clay (Johns Hopkins University) | Yarowsky, David (Johns Hopkins University) | Oates, Timothy (University of Maryland Baltimore County) | Coppersmith, Glen (JHU Human Language Technology Center of Excellence)
We present several novel minimally-supervised models for detecting latent attributes of social media users, with a focus on ethnicity and gender. Previouswork on ethnicity detection has used coarse-grained widely separated classes of ethnicity and assumed the existence of large amounts of training data such as the US census, simplifying the problem. Instead, we examine content generated by users in addition to name morpho-phonemics to detect ethnicity and gender. Further, weaddress this problem in a challenging setting where the ethnicity classes are more fine grained -- ethnicity classes in Nigeria -- and with very limited training data.
โDancing with the Stars,โ NBA Games, Politics: An Exploration of Twitter Usersโ Response to Events
Popescu, Ana-Maria (Yahoo! Labs) | Pennacchiotti, Marco (Yahoo! Labs)
Microblogging services such as Twitter offer great opportunities for analyzing the reactions of a wide audience with respect to current events. In this paper, we explore the correlation between types of user engagement and events centered around celebrities (e.g., personal or professional events involving Actors, Musicians, Politicians, Athletes).
RT to Win! Predicting Message Propagation in Twitter
Petrovic, Sasa (University of Edinburgh) | Osborne, Miles (University of Edinburgh) | Lavrenko, Victor (University of Edinburgh)
Twitter is a very popular way for people to share information on a bewildering multitude of topics. Tweets are propagated using a variety of channels: by following users or lists, by searching or by retweeting. Of these vectors, retweeting is arguably the most effective, as it can potentially reach the most people, given its viral nature. A key task is predicting if a tweet will be retweeted, and solving this problem furthers our understanding of message propagation within large user communities. We carry out a human experiment on the task of deciding whether a tweet will be retweeted which shows that the task is possible, as human performance levels are much above chance. Using a machine learning approach based on the passive-aggressive algorithm, we are able to automatically predict retweets as well as humans. Analyzing the learned model, we find that performance is dominated by social features, but that tweet features add a substantial boost.
Connecting Mutually Influencing Bloggers
Pal, Aditya (University of Minnesota) | Kawale, Jaya (University of Minnesota)
The blogosphere shows the characteristics of a power law distribution where a small set of the bloggers (influentials) get the majority of readership and the vast majority receives little traffic. Blogger recommendation algorithms aim at finding influentials for recommendation, putting bloggers with limited readership at further disadvantage. These bloggers could benefit from mutual endorsement of each other with the eventual goal of forming strong local communities with broader readership. In this paper, we propose a recommendation algorithm to connect blogger pairs with the intent that once connected the bloggers would share a mutually influencing relationship between them. In particular, we compute bloggers' influence profile based on how much she influences her blog friends and recommend bloggers with similar influence profiles. We characterize bloggers into four different groups: global leaders, connectors, local leaders, isolates. Our result shows marginal benefit for isolates and significant benefit for local leaders. Our approach can be instructive in building intelligent recommendation engine for bloggers with limited readership to build strong local communities.
An Empirical Study of Geographic User Activity Patterns in Foursquare
Noulas, Anastasios (University of Cambridge) | Scellato, Salvatore (University of Cambridge) | Mascolo, Cecilia (University of Cambridge) | Pontil, Massimiliano (University College London)
We present a large-scale study of user behavior in Foursquare, conducted on a dataset of about 700 thousand users that spans a period of more than 100 days. We analyze user checkin dynamics, demonstrating how it reveals meaningful spatio-temporal patterns and offers the opportunity to study both user mobility and urban spaces. Our aim is to inform on how scientific researchers could utilise data generated in Location-based Social Networks to attain a deeper understanding of human mobility and how developers may take advantage of such systems to enhance applications such as recommender systems.