ChoirRec: Semantic User Grouping via LLMs for Conversion Rate Prediction of Low-Activity Users
Zhai, Dakai, Gao, Jiong, Du, Boya, Xu, Junwei, Shen, Qijie, Zhu, Jialin, Jiang, Yuning
–arXiv.org Artificial Intelligence
Accurately predicting conversion rates (CVR) for low-activity users remains a fundamental challenge in large-scale e-commerce recommender systems. Existing approaches face three critical limitations: (i) reliance on noisy and unreliable behavioral signals; (ii) insufficient user-level information due to the lack of diverse interaction data; and (iii) a systemic training bias toward high-activity users that overshadows the needs of low-activity users. To address these challenges, we propose ChoirRec, a novel framework that leverages the semantic capabilities of Large Language Models (LLMs) to construct semantic user groups and enhance CVR prediction for low-activity users. With a dual-channel architecture designed for robust cross-user knowledge transfer, ChoirRec comprises three components: (i) a Semantic Group Generation module that utilizes LLMs to form reliable, cross-activity user clusters, thereby filtering out noisy signals; (ii) a Group-aware Hierarchical Representation module that enriches sparse user embeddings with informative group-level priors to mitigate data insufficiency; and (iii) a Group-aware Multi-granularity Modual that employs a dual-channel architecture and adaptive fusion mechanism to ensure effective learning and utilization of group knowledge. We conduct extensive offline and online experiments on Taobao, a leading industrial-scale e-commerce platform. ChoirRec improves GAUC by 1.16\% in offline evaluations, while online A/B testing reveals a 7.24\% increase in order volume, highlighting its substantial practical value in real-world applications.
arXiv.org Artificial Intelligence
Oct-14-2025
- Country:
- Asia
- China
- Guangdong Province > Shenzhen (0.04)
- Zhejiang Province > Hangzhou (0.05)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- China
- Europe
- France > Île-de-France
- Slovenia > Central Slovenia
- Municipality of Ljubljana > Ljubljana (0.04)
- Spain > Galicia
- Madrid (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- Mexico > Yucatán
- Mérida (0.04)
- United States
- Alaska > Anchorage Municipality
- Anchorage (0.04)
- New York > New York County
- New York City (0.05)
- Texas > Travis County
- Austin (0.04)
- Alaska > Anchorage Municipality
- Canada
- Asia
- Genre:
- Research Report (0.65)
- Industry:
- Information Technology (0.68)
- Technology: