Pre-trained Recommender Systems: A Causal Debiasing Perspective

Lin, Ziqian, Ding, Hao, Hoang, Nghia Trong, Kveton, Branislav, Deoras, Anoop, Wang, Hao

Jan-8-2024–arXiv.org Artificial Intelligence

Recent studies on pre-trained vision/language models have demonstrated the practical benefit of a new, promising solution-building paradigm in AI where models can be pre-trained on broad data describing a generic task space and then adapted successfully to solve a wide range of downstream tasks, even when training data is severely limited (e.g., in zero- or few-shot learning scenarios). Inspired by such progress, we investigate in this paper the possibilities and challenges of adapting such a paradigm to the context of recommender systems, which is less investigated from the perspective of pre-trained model. In particular, we propose to develop a generic recommender that captures universal interaction patterns by training on generic user-item interaction data extracted from different domains, which can then be fast adapted to improve few-shot learning performance in unseen new domains (with limited data). However, unlike vision/language data which share strong conformity in the semantic space, universal patterns underlying recommendation data collected across different domains (e.g., different countries or different E-commerce platforms) are often occluded by both in-domain and cross-domain biases implicitly imposed by the cultural differences in their user and item bases, as well as their uses of different e-commerce platforms. As shown in our experiments, such heterogeneous biases in the data tend to hinder the effectiveness of the pre-trained model. To address this challenge, we further introduce and formalize a causal debiasing perspective, which is substantiated via a hierarchical Bayesian deep learning model, named PreRec. Our empirical studies on real-world data show that the proposed model could significantly improve the recommendation performance in zero- and few-shot learning settings under both cross-market and cross-platform scenarios.

prerec, recommendation, target domain, (14 more...)

arXiv.org Artificial Intelligence

Jan-8-2024

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia (0.05)
- North America
  - United States
    - Washington (0.04)
    - Wisconsin > Dane County
      - Madison (0.04)
    - Texas > Harris County
      - Houston (0.04)
    - New York > New York County
      - New York City (0.04)
    - California > San Diego County
      - San Diego (0.04)
  - Mexico > Yucatán
    - Mérida (0.05)
  - Canada > Quebec
    - Montreal (0.04)
- Europe
  - Germany (0.05)
  - Spain (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
- Asia
  - Japan (0.05)
  - India (0.04)

Genre:
- Research Report
  - New Finding (0.34)
  - Promising Solution (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Personal Assistant Systems (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)