Goto

Collaborating Authors

 positive item



ITEM: Improving Training and Evaluation of Message-Passing based GNNs for top-k recommendation

Karmim, Yannis, Ramzi, Elias, Fournier-S'niehotta, Raphaël, Thome, Nicolas

arXiv.org Artificial Intelligence

Graph Neural Networks (GNNs), especially message-passing-based models, have become prominent in top-k recommendation tasks, outperforming matrix factorization models due to their ability to efficiently aggregate information from a broader context. Although GNNs are evaluated with ranking-based metrics, e.g. NDCG@k and Recall@k, they remain largely trained with proxy losses, e.g. the BPR loss. In this work we explore the use of ranking loss functions to directly optimize the evaluation metrics, an area not extensively investigated in the GNN community for collaborative filtering. We take advantage of smooth approximations of the rank to facilitate end-to-end training of GNNs and propose a Personalized PageRank-based negative sampling strategy tailored for ranking loss functions. Moreover, we extend the evaluation of GNN models for top-k recommendation tasks with an inductive user-centric protocol, providing a more accurate reflection of real-world applications. Our proposed method significantly outperforms the standard BPR loss and more advanced losses across four datasets and four recent GNN architectures while also exhibiting faster training.


Revisiting Recommendation Loss Functions through Contrastive Learning (Technical Report)

Li, Dong, Jin, Ruoming, Ren, Bin

arXiv.org Artificial Intelligence

Inspired by the success of contrastive learning, we systematically examine recommendation losses, including listwise (softmax), pairwise (BPR), and pointwise (MSE and CCL) losses. In this endeavor, we introduce InfoNCE+, an optimized generalization of InfoNCE with balance coefficients, and highlight its performance advantages, particularly when aligned with our new decoupled contrastive loss, MINE+. We also leverage debiased InfoNCE to debias pointwise recommendation loss (CCL) as Debiased CCL. Interestingly, our analysis reveals that linear models like iALS and EASE are inherently debiased. Empirical results demonstrates the effectiveness of MINE+ and Debiased-CCL.


Dimension Independent Mixup for Hard Negative Sample in Collaborative Filtering

Wu, Xi, Yang, Liangwei, Gong, Jibing, Zhou, Chao, Lin, Tianyu, Liu, Xiaolong, Yu, Philip S.

arXiv.org Artificial Intelligence

To address this In the contemporary era of voluminous data [17], individuals are limitation, we propose Dimension Independent Mixup for Hard inundated with an incessant influx of content generated by the internet. Negative Sampling (DINS), which is the first Area-wise sampling To address the issue of information overload, Recommender method for training CF-based models. DINS comprises three modules: Systems (RecSys) are employed to assist users in locating the most Hard Boundary Definition, Dimension Independent Mixup, relevant information and are increasingly pivotal in online services and Multi-hop Pooling. Experiments with real-world datasets on such as news feed [30], music suggestion [5], and online shopping both matrix factorization and graph-based models demonstrate [9]. Collaborative filtering (CF) [13], a highly effective method that DINS outperforms other negative sampling methods, establishing that predicts a user's preference based on their past interactions, is its effectiveness and superiority. Our work contributes a new widely employed. The latest CF-based models [10, 28] incorporate perspective, introduces Area-wise sampling, and presents DINS historical interactions into condensed user/item vectors and predict as a novel approach that achieves state-of-the-art performance for a user's preference for each item based on the dot product of negative sampling.


Integrating Item Relevance in Training Loss for Sequential Recommender Systems

Bacciu, Andrea, Siciliano, Federico, Tonellotto, Nicola, Silvestri, Fabrizio

arXiv.org Artificial Intelligence

Sequential Recommender Systems (SRSs) are a popular type of recommender system that learns from a user's history to predict the next item they are likely to interact with. However, user interactions can be affected by noise stemming from account sharing, inconsistent preferences, or accidental clicks. To address this issue, we (i) propose a new evaluation protocol that takes multiple future items into account and (ii) introduce a novel relevance-aware loss function to train a SRS with multiple future items to make it more robust to noise. Our relevance-aware models obtain an improvement of ~1.2% of NDCG@10 and 0.88% in the traditional evaluation protocol, while in the new evaluation protocol, the improvement is ~1.63% of NDCG@10 and ~1.5% of HR w.r.t the best performing models.


Adapting Triplet Importance of Implicit Feedback for Personalized Recommendation

Wu, Haolun, Ma, Chen, Zhang, Yingxue, Liu, Xue, Tang, Ruiming, Coates, Mark

arXiv.org Artificial Intelligence

Implicit feedback is frequently used for developing personalized recommendation services due to its ubiquity and accessibility in real-world systems. In order to effectively utilize such information, most research adopts the pairwise ranking method on constructed training triplets (user, positive item, negative item) and aims to distinguish between positive items and negative items for each user. However, most of these methods treat all the training triplets equally, which ignores the subtle difference between different positive or negative items. On the other hand, even though some other works make use of the auxiliary information (e.g., dwell time) of user behaviors to capture this subtle difference, such auxiliary information is hard to obtain. To mitigate the aforementioned problems, we propose a novel training framework named Triplet Importance Learning (TIL), which adaptively learns the importance score of training triplets. We devise two strategies for the importance score generation and formulate the whole procedure as a bilevel optimization, which does not require any rule-based design. We integrate the proposed training procedure with several Matrix Factorization (MF)- and Graph Neural Network (GNN)-based recommendation models, demonstrating the compatibility of our framework. Via a comparison using three real-world datasets with many state-of-the-art methods, we show that our proposed method outperforms the best existing models by 3-21\% in terms of Recall@k for the top-k recommendation.


Multi-Objective Personalized Product Retrieval in Taobao Search

Zheng, Yukun, Bian, Jiang, Meng, Guanghao, Zhang, Chao, Wang, Honggang, Zhang, Zhixuan, Li, Sen, Zhuang, Tao, Liu, Qingwen, Zeng, Xiaoyi

arXiv.org Artificial Intelligence

In large-scale e-commerce platforms like Taobao, it is a big challenge to retrieve products that satisfy users from billions of candidates. This has been a common concern of academia and industry. Recently, plenty of works in this domain have achieved significant improvements by enhancing embedding-based retrieval (EBR) methods, including the Multi-Grained Deep Semantic Product Retrieval (MGDSPR) model [16] in Taobao search engine. However, we find that MGDSPR still has problems of poor relevance and weak personalization compared to other retrieval methods in our online system, such as lexical matching and collaborative filtering. These problems promote us to further strengthen the capabilities of our EBR model in both relevance estimation and personalized retrieval. In this paper, we propose a novel Multi-Objective Personalized Product Retrieval (MOPPR) model with four hierarchical optimization objectives: relevance, exposure, click and purchase. We construct entire-space multi-positive samples to train MOPPR, rather than the single-positive samples for existing EBR models.We adopt a modified softmax loss for optimizing multiple objectives. Results of extensive offline and online experiments show that MOPPR outperforms the baseline MGDSPR on evaluation metrics of relevance estimation and personalized retrieval. MOPPR achieves 0.96% transaction and 1.29% GMV improvements in a 28-day online A/B test. Since the Double-11 shopping festival of 2021, MOPPR has been fully deployed in mobile Taobao search, replacing the previous MGDSPR. Finally, we discuss several advanced topics of our deeper explorations on multi-objective retrieval and ranking to contribute to the community.


Unbiased Estimations based on Binary Classifiers: A Maximum Likelihood Approach

Puts, Marco J. H., Daas, Piet J. H.

arXiv.org Machine Learning

Binary classifiers trained on a certain proportion of positive items introduce a bias when applied to data sets with different proportions of positive items. Most solutions for dealing with this issue assume that some information on the latter distribution is known. However, this is not always the case, certainly when this proportion is the target variable. In this paper a maximum likelihood estimator for the true proportion of positives in data sets is suggested and tested on synthetic and real world data.


Vision-based Price Suggestion for Online Second-hand Items

Han, Liang, Yin, Zhaozheng, Xia, Zhurong, Guo, Li, Tang, Mingqian, Jin, Rong

arXiv.org Artificial Intelligence

Different from shopping in physical stores, where people have the opportunity to closely check a product (e.g., touching the surface of a T-shirt or smelling the scent of perfume) before making a purchase decision, online shoppers rely greatly on the uploaded product images to make any purchase decision. The decision-making is challenging when selling or purchasing second-hand items online since estimating the items' prices is not trivial. In this work, we present a vision-based price suggestion system for the online second-hand item shopping platform. The goal of vision-based price suggestion is to help sellers set effective prices for their second-hand listings with the images uploaded to the online platforms. First, we propose to better extract representative visual features from the images with the aid of some other image-based item information (e.g., category, brand). Then, we design a vision-based price suggestion module which takes the extracted visual features along with some statistical item features from the shopping platform as the inputs to determine whether an uploaded item image is qualified for price suggestion by a binary classification model, and provide price suggestions for items with qualified images by a regression model. According to two demands from the platform, two different objective functions are proposed to jointly optimize the classification model and the regression model. For better model training, we also propose a warm-up training strategy for the joint optimization. Extensive experiments on a large real-world dataset demonstrate the effectiveness of our vision-based price prediction system.


Price Suggestion for Online Second-hand Items with Texts and Images

Han, Liang, Yin, Zhaozheng, Xia, Zhurong, Tang, Mingqian, Jin, Rong

arXiv.org Artificial Intelligence

This paper presents an intelligent price suggestion system for online second-hand listings based on their uploaded images and text descriptions. The goal of price prediction is to help sellers set effective and reasonable prices for their second-hand items with the images and text descriptions uploaded to the online platforms. Specifically, we design a multi-modal price suggestion system which takes as input the extracted visual and textual features along with some statistical item features collected from the second-hand item shopping platform to determine whether the image and text of an uploaded second-hand item are qualified for reasonable price suggestion with a binary classification model, and provide price suggestions for second-hand items with qualified images and text descriptions with a regression model. To satisfy different demands, two different constraints are added into the joint training of the classification model and the regression model. Moreover, a customized loss function is designed for optimizing the regression model to provide price suggestions for second-hand items, which can not only maximize the gain of the sellers but also facilitate the online transaction. We also derive a set of metrics to better evaluate the proposed price suggestion system. Extensive experiments on a large real-world dataset demonstrate the effectiveness of the proposed multi-modal price suggestion system.