CSRM-LLM: Embracing Multilingual LLMs for Cold-Start Relevance Matching in Emerging E-commerce Markets
Wang, Yujing, Chen, Yiren, Li, Huoran, Xu, Chunxu, Luo, Yuchong, Mao, Xianghui, Li, Cong, Du, Lun, Ma, Chunyang, Jiang, Qiqi, Wang, Yin, Gao, Fan, Mo, Wenting, Wen, Pei, Kumar, Shantanu, Park, Taejin, Song, Yiwei, Rajaram, Vijay, Cheng, Tao, Durgia, Sonu, Kolari, Pranam
–arXiv.org Artificial Intelligence
As global e-commerce platforms continue to expand, companies are entering new markets where they encounter cold-start challenges due to limited human labels and user behaviors. In this paper, we share our experiences in Coupang to provide a competitive cold-start performance of relevance matching for emerging e-commerce markets. Specifically, we present a Cold-Start Relevance Matching (CSRM) framework, utilizing a multilingual Large Language Model (LLM) to address three challenges: (1) activating cross-lingual transfer learning abilities of LLMs through machine translation tasks; (2) enhancing query understanding and incorporating e-commerce knowledge by retrieval-based query augmentation; (3) mitigating the impact of training label errors through a multi-round self-distillation training strategy. Our experiments demonstrate the effectiveness of CSRM-LLM and the proposed techniques, resulting in successful real-world deployment and significant online gains, with a 45.8% reduction in defect ratio and a 0.866% uplift in session purchase rate.
arXiv.org Artificial Intelligence
Sep-19-2025
- Country:
- Genre:
- Research Report (1.00)
- Industry:
- Information Technology > Services > e-Commerce Services (1.00)
- Technology: