Goto

Collaborating Authors

 Industry


Rating Friends Without Making Enemies

AAAI Conferences

As online social networks expand their role beyond maintaining existing relationships, they may look to more faceted ratings to support the formation of new connections between their users. Our study focuses on one community employing faceted ratings, CouchSurfing.org, and combines data analysis of ratings, a large-scale survey, and in-depth interviews. In order to understand the ratings, we revisit the notions of friendship and trust and uncover an asymmetry: close friendship includes trust, but high levels of trust can be achieved without close friendship. To users, providing faceted ratings presents challenges, including differentiating and quantifying inherently subjective feelings such as friendship and trust, concern over a friend's reaction to a rating, and knowledge of how ratings can affect others' reputations. One consequence of these issues is the near absence of negative feedback, even though a small portion of actual experiences and privately held ratings are negative. We show how users take this into account when formulating and interpreting ratings, and discuss designs that could encourage more balanced feedback.


Modeling Public Mood and Emotion: Twitter Sentiment and Socio-Economic Phenomena

AAAI Conferences

We perform a sentiment analysis of all tweets published on the microblogging platform Twitter in the second half of 2008. We use a psychometric instrument to extract six mood states (tension, depression, anger, vigor, fatigue, confusion) from the aggregated Twitter content and compute a six-dimensional mood vector for each day in the timeline. We compare our results to a record of popular events gathered from media and sources. We find that events in the social, political, cultural and economic sphere do have a significant, immediate and highly specific effect on the various dimensions of public mood. We speculate that large scale analyses of mood can provide a solid platform to model collective emotive trends in terms of their predictive value with regards to existing social as well as economic indicators.


Exploiting Semantic Annotations for Clustering Geographic Areas and Users in Location-based Social Networks

AAAI Conferences

Location-Based Social Networks (LBSN) present so far the most vivid realization of the convergence of the physical and virtual social planes. In this work we propose a novel approach on modeling human activity and geographical areas by means of place categories. We apply a spectral clustering algorithm on areas and users of two metropolitan cities on a dataset sourced from the most vibrant LBSN, Foursquare. Our methodology allows the identification of user communities that visit similar categories of places and the comparison of urban neighborhoods within and across cities. We demonstrate how semantic information attached to places could be plausibly used as a modeling interface for applications such as recommender systems and digital tourist guides.


Asked and Answered: On Qualities and Quantities of Answers in Online Q&A Sites

AAAI Conferences

This paper builds upon several recent research efforts that have explored the nature and qualities of questions asked on these social Q&A sites by offering a focused examination of answers posted to three of the most popular Q&A sites. Specifically, this paper examines sets of answers responding to specific types of questions and explores the degree to which question types are predictive of answer quantity and answer quality. Blending qualitative and quantitative methods, the paper builds upon rich coding of a representative sets of real questions — drawn from Answerbag, (Ask) MetaFilter, and Yahoo! Answers — in order to better understand whether the explicit and implicit theories and predictions drawn from coding of these questions were borne out in the corresponding answer sets found on these sites. Quantitative findings include data underscoring the general overall success of social Q&A sites in producing answers that can satisfy the needs of those who pose questions. Additionally, this paper presents a predictive model that can anticipate the archival value of answers based on the category and qualities of questions asked. Qualitative findings include an analysis of the variation in responses to questions that are primarily seeking objective, grounded information relative to those seeking subjective opinions.


Modeling the Detection of Textual Cyberbullying

AAAI Conferences

The scourge of cyberbullying has assumed alarming proportions with an ever-increasing number of adolescents admitting to having dealt with it either as a victim or as a bystander. Anonymity and the lack of meaningful supervision in the electronic medium are two factors that have exacerbated this social menace. Comments or posts involving sensitive topics that are personal to an individual are more likely to be internalized by a victim, often resulting in tragic outcomes. We decompose the overall detection problem into detection of sensitive topics, lending itself into text classification sub-problems. We experiment with a corpus of 4500 YouTube comments, applying a range of binary and multiclass classifiers. We find that binary classifiers for individual labels outperform multiclass classifiers. Our findings show that the detection of textual cyberbullying can be tackled by building individual topic-sensitive classifiers.


Information Markets for Social Participation in Public Policy Design and Implementation

AAAI Conferences

In this paper we propose a research agenda on the use of information markets as tools to collect, aggregate and analyze citizens’ opinions, expectations and preferences from social media in order to support public policy design and implementation. We argue that markets are institutional settings able to efficiently allocate scarce resources, aggregate and disseminate information into prices and accommodate hedging against various types of risks. We discuss various types of information markets, as well as address the participation of both human and computational agents in such markets.


GlobalIdentifier: Unexpected Personal Social Content with Data on the Web

AAAI Conferences

The past year has seen a growing public awareness of the privacy risks of social networking through personal information that people voluntarily disclose. A spotlight has accordingly been turned on the disclosure policies of social networking sites and on mechanisms for restricting access to personal information on Facebook and other sites. But this is not sufficient to address privacy concerns in a world where Web-based data mining tools can let anyone infer information about others by combining data from multiple sources. To illustrate this, we are building a demonstration data miner, GlobalInferencer, that makes inferences about an individual?s lifestyle and other behavior. GlobalInferencer uses linked data technology to perform unified searches across Facebook, Flickr, and public data sites. It demonstrates that controlling access to personal information on individual social networking sites is not an adequate framework for protecting privacy, or even for supporting valid inferencing. In addition to access restrictions, there must be mechanisms for maintaining the provenance of information combined from multiple sources, for revealing the context within which information is presented, and for respecting the accountability that determines how information should be used.


TweetTrader.net: Leveraging Crowd Wisdom in a Stock Microblogging Forum

AAAI Conferences

TweetTrader.net is a stock microblogging forum that leverages the wisdom of crowds to aggregate the information contained in stock-related tweets. Based on insights from academic research on stock microblogs, the application integrates inputs from text classification, user voting and a proprietary Stock Game in order to extract the sentiment (i.e., the bullishness) of online investors with respect to all publicly traded companies of the S&P 500.


Does Bad News Go Away Faster?

AAAI Conferences

We study the relationship between content and temporal dynamics of information on Twitter, focusing on the persistence of information. We compare two extreme temporal patterns in the decay rate of URLs embedded in tweets, defining a prediction task to distinguish between URLs that fade rapidly following their peak of popularity and those that fade more slowly. Our experiments show a strong association between the content and the temporal dynamics of information: given unigram features extracted from corresponding HTML webpages, a linear SVM classifier can predict the temporal pattern of URLs with high accuracy. We further explore the content of URLs in the two temporal classes using various textual analysis techniques (via LIWC and trend detection). We find that the rapidly-fading information contains significantly more words related to negative emotion, actions, and more complicated cognitive processes, whereas the persistent information contains more words related to positive emotion, leisure, and lifestyle.


Personalized Landmark Recommendation Based on Geotags from Photo Sharing Sites

AAAI Conferences

Geotagged photos of users on social media sites provide abundant location-based data, which can be exploited for various location-based services, such as travel recommendation. In this paper, we propose a novel approach to a new application, i.e., personalized landmark recommendation based on users’ geotagged photos. We formulate the landmark recommendation task as a collaborative filtering problem, for which we propose a category-regularized matrix factorization approach that integrates both user-landmark preference and category-based landmark similarity. We collected geotagged photos from Flickr and landmark categories from Wikipedia for our experiments. Our experimental results demonstrate that the proposed approach outperforms popularity-based landmark recommendation and a basic matrix factorization approach in recommending personalized landmarks that are less visited by the population as a whole.