Goto

Collaborating Authors

 Country


Toward Social Causality: An Analysis of Interpersonal Relationships in Online Blogs and Forums

AAAI Conferences

In this paper we present encouraging preliminary results into the problem of social causality (causal reasoning used by intelligent agents in a social environment) in online social interactions based on a model of reciprocity. At every level, social relationships are guided by the shared understanding that most actions call for appropriate reactions, and that inappropriate reactions require management. Thus, we present an analysis of interpersonal relationships in English reciprocal contexts. Specifically, we rely here on a large and recently built database of 10,882 reciprocal relation instances in online media. The resource is analyzed along a set of novel and important dimensions: symmetry, affective value, gender}, and {\em intentionality of action which are highly interconnected. At a larger level, we automatically generate {\em chains of causal relations} between verbs indicating interpersonal relationships. Statistics along these dimensions give insights into people's behavior, judgments, and thus their social interactions.


A Comparison of Generated Wikipedia Profiles Using Social Labeling and Automatic Keyword Extraction

AAAI Conferences

In many collaborative systems, researchers are interested in creating representative user profiles. In this paper, we are particularly interested in using social labeling and automatic keyword extraction techniques for generating user profiles. Social labeling is a process in which users manually tag other users with keywords. Automatic keyword extraction is a technique that selects the most salient words to represent a user’s contribution. We apply each of these two profile generation methods to highly active Wikipedia editors and their contributions, and compare the results. We found that profiles generated through social labeling matches the profiles generated via automatic keyword extraction, and vice versa. The results suggest that user profiles generated from one method can be used as a seed or bootstrapping proxy for the other method.


“How Incredibly Awesome!” — Click Here to Read More

AAAI Conferences

We investigate the impact of a discussion snippet's overall sentiment on a user's willingness to read more of a discussion. Using sentiment analysis, we constructed positive, neutral, and negative discussion snippets using the discussion topic and a sample comment from discussions taking place around content on an enterprise social networking site. We computed personalized snippet recommendations for a subset of users and conducted a survey to test how these recommendations were perceived. Our experimental results show that snippets with high sentiments are better discussion "teasers."


The Perceived Credibility of Online Encyclopedias Among Children

AAAI Conferences

This study examined young people’s trust of Wikipedia as an information resource. A large-scale probability-based survey with embedded quasi-experiments was conducted with 2,747 children in the U.S. ranging from 11 to 18 years old. Results show that young people find Wikipedia to be fairly credible, but also exhibit an awareness of potential problems with non-expert, user-generated content in anonymous environments. Children tend to evaluate the credibility of online encyclopedia information with this in mind, at times with what appears to be an unwarranted devaluation of this information.


Characterizing Microblogs with Topic Models

AAAI Conferences

As microblogging grows in popularity, services like Twitter are coming to support information gathering needs above and beyond their traditional roles as social networks. But most users’ interaction with Twitter is still primarily focused on their social graphs, forcing the often inappropriate conflation of “people I follow” with “stuff I want to read.” We characterize some information needs that the current Twitter interface fails to support, and argue for better representations of content for solving these challenges. We present a scalable implementation of a partially supervised learning model (Labeled LDA) that maps the content of the Twitter feed into dimensions. These dimensions correspond roughly to substance, style, status, and social characteristics of posts. We characterize users and tweets using this model, and present results on two information consumption oriented tasks.


The Wisdom of Bookies? Sentiment Analysis Versus. the NFL Point Spread

AAAI Conferences

The American Football betting market provides a particularly attractive domain to study the nexus between public sentiment and the wisdom of crowds. In this paper, we present the first substantial study of the relationship between the NFL betting line and public opinion expressed in blogs and microblogs (Twitter). We perform a large-scale study of four distinct text streams: LiveJournal blogs, RSS blog feeds captured by Spinn3r, Twitter, and traditional news media. Our results show interesting disparities between the first and second halves of each season. We present evidence showing usefulness of sentiment on NFL betting. We demonstrate that a strategy betting roughly 30 games per year identified winner roughly 60% of the time from 2006 to 2009, well beyond what is needed to overcome the bookie's typical commission(53%).


Star Quality: Aggregating Reviews to Rank Products and Merchants

AAAI Conferences

Given a set of reviews of products or merchants from a wide range of authors and several reviews websites, how can we measure the true quality of the product or merchant?  How do we remove the bias of individual authors or sources?  How do we compare reviews obtained from different websites, where ratings may be on different scales (1-5 stars, A/B/C, etc.)?  How do we filter out unreliable reviews to use only the ones with ``star quality''?  Taking into account these considerations, we analyze data sets from a variety of different reviews sites (the first paper, to our knowledge, to do this). These data sets include 8 million product reviews and 1.5 million merchant reviews. We explore statistic- and heuristic- based models for estimating the true quality of a product or merchant, and compare the performance of these estimators on the task of ranking pairs of objects.  We also apply the same models to the task of using Netflix ratings data to rank pairs of movies, and discover that the performance of the different models is surprisingly similar on this data set.


Reports of the AAAI 2009 Fall Symposia

AI Magazine

Series, held Thursday through Saturday, November 5-7, at he Association for the Advancement of Artificial Intelligence the Westin Arlington Gateway in Arlington, Virginia. The titles of the seven symposia were as follows: (1) Biologically Inspired Cognitive Biologically Inspired Cognitive Architectures Architectures, (2) Cognitive and Metacognitive Cognitive and Metacognitive Educational Systems Educational Systems, (3) Complex Adaptive Complex Adaptive Systems and the Threshold Effect: Views from the Natural Systems and the Threshold Effect: Views and Social Sciences from the Natural and Social Sciences, (4) Manifold Manifold Learning and Its Applications Learning and Its Applications, (5) Multirepresentational Architectures for Human-Level Multirepresentational Architectures for Human-Level Intelligence Intelligence, (6) The Uses of Computational The Uses of Computational Argumentation Argumentation, and (7) Virtual Healthcare Virtual Healthcare Interaction Interaction. An informal reception was held on Thursday, November 5. A general plenary session, in which the highlights of each symposium were presented, was held on Friday, November 6. The challenge of creating a real-life computational equivalent of the human mind requires that we better understand at a computational level how natural intelligent systems develop their cognitive and learning functions. They will behave, variety of disjoined communities and schools of learn, communicate, and "think" as conscious thought that used to speak different languages and beings in general, in addition to being able to perform ignore each other.



An Integrated Modeling Environment to Study the Co-evolution of Networks, Individual Behavior and Epidemics

AI Magazine

We discuss an interaction-based approach to study the coevolution between socio-technical networks, individual behaviors, and contagion processes on these networks. We use epidemics in human population as an example of this phenomenon. The methods consist of developing synthetic yet realistic national-scale networks using a first principles approach. Unlike simple random graph techniques, these methods combine real world data sources with behavioral and social theories to synthesize detailed social contact (proximity) networks. Individual-based models of within-host disease progression and inter-host transmission are then used to model the contagion process. Finally, models of individual behaviors are composed with disease progression models to develop a realistic representation of the complex system in which individual behaviors and the social network adapt to the contagion. These methods are embodied within Simdemics – a general purpose modeling environment to support pandemic planning and response. Simdemics is designed specifically to be scalable to networks with 300 million agents – the underlying algorithms and methods in Simdemics are all high-performance computing oriented methods. New advances in network science, machine learning, high performance computing, data mining and behavioral modeling were necessary to develop Simdemics. Simdemics is combined with two other environments, Simfrastructure and Didactic, to form an integrated cyberenvironment. The integrated cyber-environment provides the end-user flexible and seamless Internet based access to Simdemics. Service-oriented architectures play a critical role in delivering the desired services to the end user. Simdemics, in conjunction with the integrated cyber-environment, has been used in over a dozen user defined case studies. These case studies were done to support specific policy questions that arose in the context of planning the response to pandemics (e.g., H1N1, H5N1) and human initiated bio-terrorism events. These studies played a crucial role in the continual development and improvement of the cyber-environment.