PSCon: Toward Conversational Product Search

Zou, Jie, Aliannejadi, Mohammad, Kanoulas, Evangelos, Han, Shuxi, Ma, Heli, Wang, Zheng, Yang, Yang, Shen, Heng Tao

arXiv.org Artificial Intelligence 

Conversational Product Search (CPS) is confined to simulated conversations due to the lack of real-world CPS datasets that reflect human-like language. Additionally, current conversational datasets are limited to support cross-market and multi-lingual usage. In this paper, we introduce a new CPS data collection protocol and present PSCon, a novel CPS dataset designed to assist product search via human-like conversations. The dataset is constructed using a coached human-to-human data collection protocol and supports two languages and dual markets. Also, the dataset enables thorough exploration of six subtasks of CPS: user intent detection, keyword extraction, system action prediction, question selection, item ranking, and response generation. Furthermore, we also offer an analysis of the dataset and propose a benchmark model on the proposed CPS dataset.