Personal Assistant Systems
Recommendation Algorithms for Optimizing Hit Rate, User Satisfaction and Website Revenue
Wang, Xin (Zhejiang University) | Guo, Yunhui (Zhejiang University) | Xu, Congfu (Zhejiang University)
We generally use hit rate to measure the performance of item recommendation algorithms. In addition to hit rate, we consider another two important factors which are ignored by most previous works. First, whether users are satisfied with the recommended items. It is possible that a user has bought an item but dislikes it. Hence high hit rate does not reflect high customer satisfaction. Second, whether the website retailers are satisfied with the recommendation results. If a customer is interested in two products and wants to buy one of them, it may be better to suggest the item which can help bring more profit. Therefore, a good recommendation algorithm should not only consider improving hit rate but also consider optimizing user satisfaction and website revenue. In this paper, we propose two algorithms for the above purposes and design two modified hit rate based metrics to measure them. Experimental results on 10 real-world datasets show that our methods can not only achieve better hit rate, but improve user satisfaction and website revenue comparing with the state-of-the-art models.
A Boosting Algorithm for Item Recommendation with Implicit Feedback
Liu, Yong (Nanyang Technological University) | Zhao, Peilin (A*STAR) | Sun, Aixin (Nanyang Technological University) | Miao, Chunyan (Nanyang Technological University)
Many recommendation tasks are formulated as top- N item recommendation problems based on users' implicit feedback instead of explicit feedback. Here explicit feedback refers to users' ratings to items while implicit feedback is derived from users' interactions with items, e.g. , number of times a user plays a song. In this paper, we propose a boosting algorithm named AdaBPR ( Ada ptive B oosting P ersonalized R anking) for top- N item recommendation using users' implicit feedback. In the proposed framework, multiple homogeneous component recommenders are linearly combined to create an ensemble model, for better recommendation accuracy. The component recommenders are constructed based on a fixed collaborative filtering algorithm by using a re-weighting strategy, which assigns a dynamic weight distribution on the observed user-item interactions. AdaBPR demonstrates its effectiveness on three datasets compared with strong baseline algorithms.
Modeling Users' Dynamic Preference for Personalized Recommendation
Liu, Xin (Institute for Infocomm Research)
Modeling the evolution of users' preference over time is essential for personalized recommendation. Traditional time-aware models like (1) time-window or recency based approaches ignore or deemphasize much potentially useful information, and (2) time-aware collaborative filtering (CF) approaches largely rely on the information of other users, thus failing to precisely and comprehensively profile individual users for personalization. In this paper, for implicit feedback data, we propose a personalized recommendation model to capture users' dynamic preference using Gaussian process. We first apply topic modeling to represent a user's temporal preference in an interaction as a topic distribution. By aggregating such topic distributions of the user's past interactions, we build her profile, where we treat each topic's values at different interactions as a time series. Gaussian process is then applied to predict the user's preference in the next interactions for top-N recommendation. Experiments conducted over two real datasets demonstrate that our approach outperforms the state-of-the-art recommendation models by at least 42.46% and 66.14% in terms of precision and Mean Reciprocal Rank respectively.
Sparse Probabilistic Matrix Factorization by Laplace Distribution for Collaborative Filtering
Jing, Liping (Beijing Key Lab of Traffic Data Analysis and Mining and Beijing Jiaotong University) | Wang, Peng (Beijing Key Lab of Traffic Data Analysis and Mining and Beijing Jiaotong University) | Yang, Liu (Beijing Key Lab of Traffic Data Analysis and Mining and Beijing Jiaotong University)
In recommendation systems, probabilistic matrix factorization (PMF) is a state-of-the-art collaborative filtering method by determining the latent features to represent users and items. However, two major issues limiting the usefulness of PMF are the sparsity problem and long-tail distribution. Sparsity refers to the situation that the observed rating data are sparse, which results in that only part of latent features are informative for describing each item/user. Long tail distribution implies that a large fraction of items have few ratings. In this work, we propose a sparse probabilistic matrix factorization method (SPMF) by utilizing a Laplacian distribution to model the item/user factor vector. Laplacian distribution has ability to generate sparse coding, which is beneficial for SPMF to distinguish the relevant and irrelevant latent features with respect to each item/user. Meanwhile, the tails in Laplacian distribution are comparatively heavy, which is rewarding for SPMF to recommend the tail items. Furthermore, a distributed Gibbs sampling algorithm is developed to efficiently train the proposed sparse probabilistic model. A series of experiments on Netfilix and Movielens datasets have been conducted to demonstrate that SPMF outperforms the existing PMF and its extended version Bayesian PMF (BPMF), especially for the recommendation of tail items.
Differentially Private Matrix Factorization
Hua, Jingyu (Nanjing University) | Xia, Chang (Nanjing University) | Zhong, Sheng (Nanjing University)
Matrix factorization (MF) is a prevailing collaborative filtering method for building recommender systems. It requires users to upload their personal preferences to the recommender for performing MF, which raises serious privacy concerns. This paper proposes a differentially private MF mechanism that can prevent an untrusted recommender from learning any users' ratings or profiles. Our design decouples computations upon users' private data from the recommender to users, and makes the recommender aggregate local results in a privacy-preserving way. It uses the objective perturbation to make sure that the final item profiles satisfy differential privacy and solves the challenge to decompose the noise component for objective perturbation into small pieces that can be determined locally and independently by users. We also propose a third-party based mechanism to reduce noises added in each iteration and adapt our online algorithm to the dynamic setting that allows users to leave and join. The experiments show that our proposal is efficient and introduces acceptable side effects on the precision of results.
A Synthetic Approach for Recommendation: Combining Ratings, Social Relations, and Reviews
Hu, Guang-Neng (Nanjing University) | Dai, Xin-Yu (Nanjing University) | Song, Yunya (Hong Kong Baptist University) | Huang, Shu-Jian (Nanjing University) | Chen, Jia-Jun (State Key Laboratory for Novel Software Technology, Nanjing University)
Recommender systems (RSs) provide an effective way of alleviating the information overload problem by selecting personalized choices. Online social networks and user-generated content provide diverse sources for recommendation beyond ratings, which present opportunities as well as challenges for traditional RSs. Although social matrix factorization (Social MF) can integrate ratings with social relations and topic matrix factorization can integrate ratings with item reviews, both of them ignore some useful information. In this paper, we investigate the effective data fusion by combining the two approaches, in two steps. First, we extend Social MF to exploit the graph structure of neighbors. Second, we propose a novel framework MR3 to jointly model these three types of information effectively for rating prediction by aligning latent factors and hidden topics. We achieve more accurate rating prediction on two real-life datasets. Furthermore, we measure the contribution of each data source to the proposed framework.
Music Recommenders: User Evaluation Without Real Users?
Craw, Susan (Robert Gordon University) | Horsburgh, Ben (Robert Gordon University) | Massie, Stewart (Robert Gordon University)
Good music recommenders should not only suggest quality recommendations, but should also allow users to discover new/niche music. User studies capture explicit feedback on recommendation quality and novelty, but can be expensive, and may have difficulty replicating realistic scenarios. Lack of effective offline evaluation methods restricts progress in music recommendation research. The challenge is finding suitable measures to score recommendation quality, and in particular avoiding popularity bias, whereby the quality is not recognised when the track is not well known. This paper presents a low cost method that leverages available social media data and shows it to be effective. Not only is it based on explicit feedback from many users, but it also overcomes the popularity bias that disadvantages new/niche music. Experiments show that its findings are consistent with those from an online study with real users. In comparisons with other offline measures, the social media score is shown to be a more reliable proxy for opinions of real users. Its impact on music recommendation is its ability to recognise recommenders that enable discovery, as well as suggest quality recommendations.
Optimal Greedy Diversity for Recommendation
Ashkan, Azin (Technicolor Research) | Kveton, Branislav (Adobe Research) | Berkovsky, Shlomo (CSIRO) | Wen, Zheng (Yahoo! Labs)
The need for diversification manifests in various recommendation use cases. In this work, we propose a novel approach to diversifying a list of recommended items, which maximizes the utility of the items subject to the increase in their diversity. From a technical perspective, the problem can be viewed as maximization of a modular function on the polytope of a submodular function, which can be solved optimally by a greedy method. We evaluate our approach in an offline analysis, which incorporates a number of baselines and metrics, and in two online user studies. In all the experiments, our method outperforms the baseline methods.
Personalizing Product Rankings Using Collaborative Filtering on Opinion-Derived Topic Profiles
Musat, Claudiu Cristian (Ecole Polytechnique Federale de Lausanne) | Faltings, Boi (Ecole Polytechnique Federale de Lausanne)
Product review sites such as TripAdvisor, Yelp or Amazon provide a single, non personalized ranking of products. The sparse review data makes personalizing recommendations difficult. Topic Profile Collaborative Filtering exploits review texts to identify user profiles as a basis for similarity. We show that careful use of the available data and separating users into classes can greatly improve the performance of such techniques. We significantly improve MAE, RMSE, and Kendall tau, compared to the previous best results. In addition, we show that personalization does not benefit all the users to the same extent. We propose switching between a personalized and a non personalized method based on the user opinion profile. We show that the user's opinionatedness is a good indicator of whether the personalization will work or not.
Dependent Indian Buffet Process-based Sparse Nonparametric Nonnegative Matrix Factorization
Xuan, Junyu, Lu, Jie, Zhang, Guangquan, Da Xu, Richard Yi, Luo, Xiangfeng
Nonnegative Matrix Factorization (NMF) aims to factorize a matrix into two optimized nonnegative matrices appropriate for the intended applications. The method has been widely used for unsupervised learning tasks, including recommender systems (rating matrix of users by items) and document clustering (weighting matrix of papers by keywords). However, traditional NMF methods typically assume the number of latent factors (i.e., dimensionality of the loading matrices) to be fixed. This assumption makes them inflexible for many applications. In this paper, we propose a nonparametric NMF framework to mitigate this issue by using dependent Indian Buffet Processes (dIBP). In a nutshell, we apply a correlation function for the generation of two stick weights associated with each pair of columns of loading matrices, while still maintaining their respective marginal distribution specified by IBP. As a consequence, the generation of two loading matrices will be column-wise (indirectly) correlated. Under this same framework, two classes of correlation function are proposed (1) using Bivariate beta distribution and (2) using Copula function. Both methods allow us to adopt our work for various applications by flexibly choosing an appropriate parameter settings. Compared with the other state-of-the art approaches in this area, such as using Gaussian Process (GP)-based dIBP, our work is seen to be much more flexible in terms of allowing the two corresponding binary matrix columns to have greater variations in their non-zero entries. Our experiments on the real-world and synthetic datasets show that three proposed models perform well on the document clustering task comparing standard NMF without predefining the dimension for the factor matrices, and the Bivariate beta distribution-based and Copula-based models have better flexibility than the GP-based model.