Not enough data to create a plot.
Try a different view from the menu above.
Institute for Infocomm Research
Active Transfer Learning for Cross-System Recommendation
Zhao, Lili (The Hong Kong University of Science and Technology) | Pan, Sinno Jialin (Institute for Infocomm Research) | Xiang, Evan Wei (Baidu Inc.) | Zhong, Erheng (The Hong Kong University of Science and Technology) | Lu, Zhongqi (The Hong Kong University of Science and Technology) | Yang, Qiang (Huawei Noah’s Ark Lab)
Recommender systems, especially the newly launched ones, have to deal with the data-sparsity issue, where little existing rating information is available. Recently, transfer learning has been proposed to address this problem by leveraging the knowledge from related recommender systems where rich collaborative data are available. However, most previous transfer learning models assume that entity-correspondences across different systems are given as input, which means that for any entity (e.g., a user or an item) in a target system, its corresponding entity in a source system is known. This assumption can hardly be satisfied in real-world scenarios where entity-correspondences across systems are usually unknown, and the cost of identifying them can be expensive. For example, it is extremely difficult to identify whether a user A from Facebook and a user B from Twitter are the same person. In this paper, we propose a framework to construct entity correspondence with limited budget by using active learning to facilitate knowledge transfer across recommender systems. Specifically, for the purpose of maximizing knowledge transfer, we first iteratively select entities in the target system based on our proposed criterion to query their correspondences in the source system. We then plug the actively constructed entity-correspondence mapping into a general transferred collaborative-filtering model to improve recommendation quality. We perform extensive experiments on real world datasets to verify the effectiveness of our proposed framework for this cross-system recommendation problem.
From Semantic to Emotional Space in Probabilistic Sense Sentiment Analysis
Mohtarami, Mitra (National University of Singapore) | Lan, Man (Institute for Infocomm Research) | Tan, Chew Lim (National University of Singapore)
This paper proposes an effective approach to model the emotional space of words to infer their Sense Sentiment Similarity (SSS). SSS reflects the distance between the words regarding their senses and underlying sentiments. We propose a probabilistic approach that is built on a hidden emotional model in which the basic human emotions are considered as hidden. This leads to predict a vector of emotions for each sense of the words, and then to infer the sense sentiment similarity. The effectiveness of the proposed approach is investigated in two Natural Language Processing tasks: Indirect yes/no Question Answer Pairs Inference and Sentiment Orientation Prediction.
Sense Sentiment Similarity: An Analysis
Mohtarami, Mitra (National University of Singapore) | Amiri, Hadi (National University of Singapore) | Lan, Man (Institute for Infocomm Research) | Tran, Thanh Phu (National University of Singapore) | Tan, Chew Lim (National University of Singapore)
This paper describes an emotion-based approach to acquire sentiment similarity of word pairs with respect to their senses. Sentiment similarity indicates the similarity between two words from their underlying sentiments. Our approach is built on a model which maps from senses of words to vectors of twelve basic emotions. The emotional vectors are used to measure the sentiment similarity of word pairs. We show the utility of measuring sentiment similarity in two main natural language processing tasks, namely, indirect yes/no question answer pairs (IQAP) Inference and sentiment orientation (SO) prediction. Extensive experiments demonstrate that our approach can effectively capture the sentiment similarity of word pairs and utilize this information to address the above mentioned tasks.
Heterogeneous Transfer Learning for Image Classification
Zhu, Yin (Hong Kong University of Science and Technology) | Chen, Yuqiang (Shanghai Jiao Tong University) | Lu, Zhongqi (†Hong Kong University of Science and Technology) | Pan, Sinno Jialin (Institute for Infocomm Research) | Xue, Gui-Rong (Shanghai Jiao Tong University) | Yu, Yong (Shanghai Jiao Tong University) | Yang, Qiang (Hong Kong University of Science and Technology)
Transfer learning as a new machine learning paradigm has gained increasing attention lately. In situations where the training data in a target domain are not sufficient to learn predictive models effectively, transfer learning leverages auxiliary source data from other related source domains for learning. While most of the existing works in this area only focused on using the source data with the same structure as the target data, in this paper, we push this boundary further by proposing a heterogeneous transfer learning framework for knowledge transfer between text and images. We observe that for a target-domain classification problem, some annotated images can be found on many social Web sites, which can serve as a bridge to transfer knowledge from the abundant text documents available over the Web. A key question is how to effectively transfer the knowledge in the source data even though the text can be arbitrarily found. Our solution is to enrich the representation of the target images with semantic concepts extracted from the auxiliary source data through a novel matrix factorization method. By using the latent semantic features generated by the auxiliary data, we are able to build a better integrated image classifier. We empirically demonstrate the effectiveness of our algorithm on the Caltech-256 image dataset.
Tree Sequence Kernel for Natural Language
Sun, Jun (National University of Singapore) | Zhang, Min (Institute for Infocomm Research) | Tan, Chew Lim (National University of Singapore)
We propose Tree Sequence Kernel (TSK), which implicitly exhausts the structure features of a sequence of subtrees embedded in the phrasal parse tree. By incorporating the capability of sequence kernel, TSK enriches tree kernel with tree sequence features so that it may provide additional useful patterns for machine learning applications. Two approaches of penalizing the substructures are proposed and both can be accomplished by efficient algorithms via dynamic programming. Evaluations are performed on two natural language tasks, i.e. Question Classification and Relation Extraction. Experimental results suggest that TSK outperforms tree kernel for both tasks, which also reveals that the structure features made up of multiple subtrees are effective and play a complementary role to the single tree structure.
Integrating Community Question and Answer Archives
Wei, Wei (Huazhong University of Science and Technology) | Cong, Gao (Nanyang Technological University) | Li, Xiaoli (Institute for Infocomm Research) | Ng, See-Kiong (Institute for Infocomm Research) | Li, Guohui (Huazhong University of Science and Technology)
Question and answer pairs in Community Question Answering (CQA) services are organized into hierarchical structures or taxonomies to facilitate users to find the answers for their questions conveniently. We observe that different CQA services have their own knowledge focus and used different taxonomies to organize their question and answer pairs in their archives. As there are no simple semantic mappings between the taxonomies of the CQA services, the integration of CQA services is a challenging task. The existing approaches on integrating taxonomies ignore the hierarchical structures of the source taxonomy. In this paper, we propose a novel approach that is capable of incorporating the parent-child and sibling information in the hierarchical structures of the source taxonomy for accurate taxonomy integration. Our experimental results with real world CQA data demonstrate that the proposed method significantly outperforms state-of-the-art methods.
CCRank: Parallel Learning to Rank with Cooperative Coevolution
Wang, Shuaiqiang (Shandong University of Finance) | Gao, Byron J. (Texas State University-San Marcos) | Wang, Ke (Simon Fraser University) | Lauw, Hady W. (Institute for Infocomm Research)
We propose CCRank, the first parallel algorithm for learning to rank, targeting simultaneous improvement in learning accuracy and efficiency. CCRank is based on cooperative coevolution (CC), a divide-and-conquer framework that has demonstrated high promise in function optimization for problems with large search space and complex structures. Moreover, CC naturally allows parallelization of sub-solutions to the decomposed subproblems, which can substantially boost learning efficiency. With CCRank, we investigate parallel CC in the context of learning to rank. Extensive experiments on benchmarks in comparison with the state-of-the-art algorithms show that CCRank gains in both accuracy and efficiency.
Source-Selection-Free Transfer Learning
Xiang, Evan Wei (The Hong Kong University of Science and Technology) | Pan, Sinno Jialin (Institute for Infocomm Research) | Pan, Weike (The Hong Kong University of Science and Technology) | Su, Jian (Institute for Infocomm Research) | Yang, Qiang (The Hong Kong University of Science and Technology)
Transfer learning addresses the problems that labeled training data are insufficient to produce a high-performance model. Typically, given a target learning task, most transfer learning approaches require to select one or more auxiliary tasks as sources by the designers. However, how to select the right source data to enable effective knowledge transfer automatically is still an unsolved problem, which limits the applicability of transfer learning. In this paper, we take one step ahead and propose a novel transfer learning framework, known as source-selection-free transfer learning (SSFTL), to free users from the need to select source domains. Instead of asking the users for source and target data pairs, as traditional transfer learning does, SSFTL turns to some online information sources such as World Wide Web or the Wikipedia for help. The source data for transfer learning can be hidden somewhere within this large online information source, but the users do not know where they are. Based on the online information sources, we train a large number of classifiers. Then, given a target task, a bridge is built for labels of the potential source candidates and the target domain data in SSFTL via some large online social media with tag cloud as a label translator. An added advantage of SSFTL is that, unlike many previous transfer learning approaches, which are difficult to scale up to the Web scale, SSFTL is highly scalable and can offset much of the training work to offline stage. We demonstrate the effectiveness and efficiency of SSFTL through extensive experiments on several real-world datasets in text classification.
Possibilistic Behavior Recognition in Smart Homes for Cognitive Assistance
Roy, Patrice C. (Domus Lab, Universite de Sherbrooke) | Giroux, Sylvain (Domus Lab, Université) | Bouchard, Bruno (de Sherbrooke) | Bouzouane, Abdenour (LIARA Lab, Université) | Phua, Clifton (du Québec à) | Tolstikov, Andrei (Chicoutimi) | Biswas, Jit (LIARA Lab, Université)
Providing cognitive assistance in smart homes is a field of research that receives a lot of attention lately. In order to give adequate assistance at the opportune moment, we need to recognize the observed behavior when the patient carries out some activities in a smart home. To address this challenging issue, we present a formal activity recognition framework based on possibility theory. We present initial results from an implementation of this possibilistic recognition approach in a smart home laboratory.