Kitsuregawa, Masaru
Recommending Related Microblogs: A Comparison Between Topic and WordNet based Approaches
Chen, Xing (Wuhan University of Technology) | Li, Lin (Wuhan University of Technology) | Xu, Guandong (Victoria University) | Yang, Zhenglu (The University of Tokyo) | Kitsuregawa, Masaru (The University of Tokyo)
Computing similarity between short microblogs is an important step in microblog recommendation. In this paper, we investigate a topic based approach and a WordNet based approach to estimate similarity scores between microblogs and recommend top related ones to users. Empirical study is conducted to compare their recommendation effectiveness using two evaluation measures. The results show that the WordNet based approach has relatively higher precision than that of the topic based approach using 548 tweets as dataset. In addition, the Kendall tau distance between two lists recommended by WordNet and topic approaches is calculated. Its average of all the 548 pair lists tells us the two approaches have the relative high disaccord in the ranking of related tweets.
SemRec: A Semantic Enhancement Framework for Tag Based Recommendation
Xu, Guandong (Victoria University) | Gu, Yanhui (University of Tokyo) | Dolog, Peter (Aalborg University) | Zhang, Yanchun (Victoria University) | Kitsuregawa, Masaru (University of Tokyo)
Collaborative tagging services provided by various social web sites become popular means to mark web resources for different purposes such as categorization, expression of a preference and so on. However, the tags are of syntactic nature, in a free style and do not reflect semantics, resulting in the problems of redundancy, ambiguity and less semantics. Current tag-based recommender systems mainly take the explicit structural information among users, resources and tags into consideration, while neglecting the important implicit semantic relationships hidden in tagging data. In this study, we propose a Semantic Enhancement Recommendation strategy (SemRec), based on both structural information and semantic information through a unified fusion model. Extensive experiments conducted on two real datasets demonstarte the effectiveness of our approaches.
Fast Algorithms for Top-k Approximate String Matching
Yang, Zhenglu (The University of Tokyo) | Yu, Jianjun (Chinese Academy of Sciences) | Kitsuregawa, Masaru (The University of Tokyo)
Top- k approximate querying on string collections is an important data analysis tool for many applications, and it has been exhaustively studied. However, the scale of the problem has increased dramatically because of the prevalence of the Web. In this paper, we aim to explore the efficient top- k similar string matching problem. Several efficient strategies are introduced, such as length aware and adaptive q -gram selection. We present a general q -gram based framework and propose two efficient algorithms based on the strategies introduced. Our techniques are experimentally evaluated on three real data sets and show a superior performance.