Pre-trained Recommender Systems: A Causal Debiasing Perspective
Lin, Ziqian, Ding, Hao, Hoang, Nghia Trong, Kveton, Branislav, Deoras, Anoop, Wang, Hao
–arXiv.org Artificial Intelligence
Recent studies on pre-trained vision/language models have demonstrated the practical benefit of a new, promising solution-building paradigm in AI where models can be pre-trained on broad data describing a generic task space and then adapted successfully to solve a wide range of downstream tasks, even when training data is severely limited (e.g., in zero- or few-shot learning scenarios). Inspired by such progress, we investigate in this paper the possibilities and challenges of adapting such a paradigm to the context of recommender systems, which is less investigated from the perspective of pre-trained model. In particular, we propose to develop a generic recommender that captures universal interaction patterns by training on generic user-item interaction data extracted from different domains, which can then be fast adapted to improve few-shot learning performance in unseen new domains (with limited data). However, unlike vision/language data which share strong conformity in the semantic space, universal patterns underlying recommendation data collected across different domains (e.g., different countries or different E-commerce platforms) are often occluded by both in-domain and cross-domain biases implicitly imposed by the cultural differences in their user and item bases, as well as their uses of different e-commerce platforms. As shown in our experiments, such heterogeneous biases in the data tend to hinder the effectiveness of the pre-trained model. To address this challenge, we further introduce and formalize a causal debiasing perspective, which is substantiated via a hierarchical Bayesian deep learning model, named PreRec. Our empirical studies on real-world data show that the proposed model could significantly improve the recommendation performance in zero- and few-shot learning settings under both cross-market and cross-platform scenarios.
arXiv.org Artificial Intelligence
Jan-8-2024
- Country:
- Oceania > Australia (0.05)
- North America
- United States
- Washington (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- Texas > Harris County
- Houston (0.04)
- New York > New York County
- New York City (0.04)
- California > San Diego County
- San Diego (0.04)
- Mexico > Yucatán
- Mérida (0.05)
- Canada > Quebec
- Montreal (0.04)
- United States
- Europe
- Germany (0.05)
- Spain (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Asia
- Genre:
- Research Report
- New Finding (0.34)
- Promising Solution (0.34)
- Research Report