AITopics

This paper presents a new approach to disambiguate company names in the Twitter social network. We have focused on making lighter the processing of comparing company profiles with tweets in order to obtain a competitive real-time system. With this aim, we only use the home page of each company as information source to create a unique profile. On the other hand, we compute the similarity of a tweet in connection to a profile by comparing the content of the tweet with the profile. Both steps do not use any other external information source and all the process is developed in an unsupervised way. We have tested our application with the test WePS-3 CLEF ORM corpus obtaining encouraging results.

machine learning, real time system, tweet, (18 more...)

Sixth International AAAI Conference on Weblogs and Social Media

Country:

Europe > Spain > Galicia > Madrid (0.05)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Industry: Information Technology > Services (0.34)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Architecture > Real Time Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.95)
(3 more...)

Zhang, Xin (Graduate University of Chinese Academy of Sciences) | He, Ben (Graduate University of Chinese Academy of Sciences) | Luo, Tiejian (Graduate University of Chinese Academy of Sciences)

Transductive Learning for Real-Time Twitter Search

Recency is an important dimension of relevance for real-time Twitter search as users tend to be interested in fresh news and events. By incorporating various sources of evidence, the application of learning to rank (LTR) algorithms to real-time Twitter search has shown beneficial in finding not only relevant, but also recent tweets in response to given queries. However, the potential effectiveness brought by LTR may not have been fully exploited due to the lack of labeled data available for properly learning a ranking model, since human labels are expensive in real-world applications. To this end, this paper proposes a transductive algorithm that incrementally aggregate the labeled tweets through an iterative process. Experimental results on the standard Tweets11 dataset show that our approach is able to outperform strong baselines without the use of human labels.

information retrieval, machine learning, natural language, (15 more...)

Sixth International AAAI Conference on Weblogs and Social Media

Country:

North America > United States > New York (0.04)
Asia > China > Beijing > Beijing (0.04)

Industry: Information Technology (0.47)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.31)

A Supervised Approach to Predict Company Acquisition with Factual and Topic Features Using Profiles and News Articles on TechCrunch

Merger and Acquisition (M&A) prediction has been an interesting and challenging research topic in the past a few decades. However, past work has only adopted numerical features in building models, and yet the valuable textual information from the great variety of social media sites has not been touched at all. To fully explore this information, we used the profiles and news articles for companies and people on TechCrunch, the leading and largest public database for the tech world, which anybody can edit. Specifically, we explored topic features via topic modeling techniques, as well as a set of other novel features of our design within a machine learning framework. We conducted experiments of the largest scale in the literature, and achieved a high true positive rate (TP) between 60% to 79.8% with a false positive rate (FP) mostly between 0% and 8.3% over company categories with a small number of missing attributes in the CrunchBase profiles.

artificial intelligence, category, machine learning, (12 more...)

Sixth International AAAI Conference on Weblogs and Social Media

Country:

Asia > Middle East > Jordan (0.05)
North America > United States > South Dakota > Clay County > Vermillion (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Europe > Greece (0.04)

Industry:

Information Technology (0.69)
Banking & Finance (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Computational Predictors in Online Social Deliberations

Woolf, Beverly Park (University of Massachusetts-Amherst) | Murray, Thomas (University of Massachusetts-Amherst) | Xu, Xiaoxi (University of Massachusetts-Amherst) | Osterweil, Leon (University of Massachusetts-Amherst) | Clarke, Lori (University of Massachusetts-Amherst) | Wing, Leah (University of Massachusetts-Amherst) | Katsh, Ethan (University of Massachusetts-Amherst)

This research seeks to identify online participants' disposi tion and skills. A prototype dashboard and annotation scheme were developed to support facilitators and several computational predictors were identified that show statisti cally significant correlations with dialogue skills as ob served by human annotators.

artificial intelligence, correlation, natural language, (15 more...)

Sixth International AAAI Conference on Weblogs and Social Media

Country:

North America > United States > New Jersey > Bergen County > Mahwah (0.05)
North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)

Industry: Law (0.34)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.51)

Tanev, Hristo (Joint Research Centre, European Commission) | Ehrmann, Maud (Joint Research Centre, European Commission) | Piskorski, Jakub (Frontex) | Zavarella, Vanni (Joint Research Centre, European Commission)

Enhancing Event Descriptions through Twitter Mining

We describe a simple IR approach for linking news about events, detected by an event extraction system, to messages from Twitter (tweets). In particular, we explore several methods for creating event-specific queries for Twitter and provide a quantitative and qualitative evaluation of the relevance and usefulness of the information obtained from the tweets. We showed that methods based on utilization of word co-occurrence clustering, domain-specific keywords and named entity recognition improve the performance with respect to a basic approach.

information retrieval, natural language, tweet, (19 more...)

Sixth International AAAI Conference on Weblogs and Social Media

Country:

Asia > Middle East > Yemen (0.06)
Europe > Italy (0.05)
North America > United States > Oklahoma (0.04)
(4 more...)

Industry:

Government > Military (0.49)
Information Technology > Services (0.49)
Media > News (0.47)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.58)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.50)

Filtering Noisy Web Data by Identifying and Leveraging Users' Contributions

Stoica, Alina Mihaela (EDF)

In this paper we present several methods for collecting Web textual contents and filtering noisy data. We show that knowing which user publishes which contents can contribute to detecting noise. We begin by collecting data from two forums and from Twitter. For the forums, we extract the meaningful information from each discussion (texts of question and answers, IDs of users, date). For the Twitter dataset, we first detect tweets with very similar texts, which helps avoiding redundancy in further analysis. Also, this leads us to clusters of tweets that can be used in the same way as the forum discussions: they can be modeled by bipartite graphs. The analysis of nodes of the resulting graphs shows that network structure and content type (noisy or relevant) are not independent, so network studying can help in filtering noise.

artificial intelligence, social media, tweet, (18 more...)

Sixth International AAAI Conference on Weblogs and Social Media

Country:

North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France > Île-de-France > Hauts-de-Seine > Clamart (0.04)
Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)

Industry:

Energy > Power Industry > Utilities (1.00)
Information Technology (0.69)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)
Information Technology > Data Science (0.89)

Evaluating Real-Time Search over Tweets

McCullough, Dean (National Institute of Standards and Technology) | Lin, Jimmy (University of Maryland) | Macdonald, Craig (University of Glasgow) | Ounis, Iadh (University of Glasgow) | McCreadie, Richard (University of Glasgow)

Twitter offers a phenomenal platform for the social sharing of information. We describe new resources that have been created in the context of the Text Retrieval Conference (TREC) to support the academic study of Twitter as a real-time information source. We formalize an information seeking task — real-time search — and offer a methodology for measuring system effectiveness. At the TREC 2011 Microblog Track, 58 research groups participated in the first ever evaluation of this task. We present data from the effort to illustrate and support our methodology.

artificial intelligence, real time system, tweet, (18 more...)

Sixth International AAAI Conference on Weblogs and Social Media

Country:

North America > Haiti (0.16)
North America > United States > Maryland (0.04)
Africa > Middle East > Egypt (0.04)

Industry: Information Technology > Services (0.67)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Architecture > Real Time Systems (1.00)
Information Technology > Artificial Intelligence (0.95)

Social Media Is NOT that Bad! The Lexical Quality of Social Media

Rello, Luz (Universitat Pompeu Fabra) | Baeza-Yates, Ricardo (Yahoo! Research)

There is a strong correlation between spelling errors and web text content quality. Using our lexical quality measure, based in a small corpus of spelling errors, we present an estimation of the lexical quality of the main Social Media sites. This paper presents an updated and complete analysis of the lexical quality of Social Media written in English and Spanish, including how lexical quality changes in time.

artificial intelligence, natural language, text processing, (14 more...)

Sixth International AAAI Conference on Weblogs and Social Media

Country:

North America > United States > New York > New York County > New York City (0.05)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.05)
Asia > India > Telangana > Hyderabad (0.04)

Genre: Research Report (0.47)

Industry: Information Technology > Services (0.71)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.55)

Talk of the City: Our Tweets, Our Community Happiness

Quercia, Daniele (University of Cambridge) | Seaghdha, Diarmuid O (University of Cambridge) | Crowcroft, Jon (University of Cambridge)

The literature of urban sociology and that of psychology have separately established two relationships: the first has linked characteristics of a community to its residents’ well-being, the second has linked well-being of individuals to their use of words. No one has hitherto explored the potential transitive relationship - that between characteristics of a community and its residents' use of words. We test this relationship by performing three steps. We consider Twitter users in a variety of London census communities; extract the subject matter of their tweets using "topic models"; and study the relationship between topics and community socio-economic well-being. We find that certain topics are correlated (positively and negatively) with community deprivation. Users in more deprived community tweet about wedding parties, matters expressed in Spanish/Portuguese, and celebrity gossips. By contrast, those in less deprived communities tweet about vacations, professional use of social media, environmental issues, sports, and health issues. We finally show that monitoring the subject matter of tweets not only offers insights into community well-being, but it is also a reasonable way of predicting community deprivation scores.

artificial intelligence, natural language, social media, (14 more...)

Sixth International AAAI Conference on Weblogs and Social Media

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > New York (0.04)
(6 more...)

Genre: Research Report > New Finding (0.47)

Industry:

Health & Medicine (0.95)
Leisure & Entertainment > Social Events (0.34)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.34)

Finding Influential Authors in Brand-Page Communities

Purohit, Hemant (Wright State University) | Ajmera, Jitendra (IBM Research, New Delhi) | Joshi, Sachindra (IBM Research, New Delhi) | Verma, Ashish (IBM Research, New Delhi) | Sheth, Amit (Wright State University)

Enterprises are increasingly using social media forums to engage with their customer online- a phenomenon known as Social Customer Relation Management (Social CRM) . In this context, it is important for an enterprise to identify “influential authors” and engage with them on a priority basis. We present a study towards finding influential authors on Twitter forums where an implicit network based on user interactions is created and analyzed. Furthermore, author profile features and user interaction features are combined in a decision tree classification model for finding influential authors. A novel objective evaluation criterion is used for evaluating various features and modeling techniques. We compare our methods with other approaches that use either only the formal connections or only the author profile features and show a significant improvement in the classification accuracy over these baselines as well as over using Klout score.

artificial intelligence, machine learning, profile feature, (18 more...)

Sixth International AAAI Conference on Weblogs and Social Media

Country:

North America > United States (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > India > NCT > New Delhi (0.04)
Asia > India > NCT > Delhi (0.04)

Industry: Information Technology > Services (0.47)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)