Searching, fast and slow, through product catalogs
Ubrangala, Dayananda, Sharma, Juhi, Rangappa, Sharath Kumar, R, Kiran, Kondapalli, Ravi Prasad, Boué, Laurent
–arXiv.org Artificial Intelligence
String matching algorithms in the presence of abbreviations, such as in Stock Keeping Unit (SKU) product catalogs, remains a relatively unexplored topic. In this paper, we present a unified architecture for SKU search that provides both a real-time suggestion system (based on a Trie data structure) as well as a lower latency search system (making use of character level TF-IDF in combination with language model vector embeddings) where users initiate the search process explicitly. We carry out ablation studies that justify designing a complex search system composed of multiple components to address the delicate trade-off between speed and accuracy. Using SKU search in the Dynamics CRM as an example, we show how our system vastly outperforms, in all aspects, the results provided by the default search engine. Finally, we show how SKU descriptions may be enhanced via generative text models (using gpt-3.5-turbo)
arXiv.org Artificial Intelligence
Jan-1-2024
- Country:
- North America > United States
- California > Santa Clara County > Palo Alto (0.04)
- Europe > Sweden
- Västerbotten County > Umeå (0.04)
- North America > United States
- Genre:
- Research Report (0.40)
- Technology: