Retrieval Augmented Spelling Correction for E-Commerce Applications
Guo, Xuan, Patki, Rohit, Everaert, Dante, Potts, Christopher
–arXiv.org Artificial Intelligence
The rapid introduction of new brand names into everyday language poses a unique challenge for e-commerce spelling correction services, which must distinguish genuine misspellings from novel brand names that use unconventional spelling. We seek to address this challenge via Retrieval Augmented Generation (RAG). On this approach, product names are retrieved from a catalog and incorporated into the context used by a large language model (LLM) that has been fine-tuned to do contextual spelling correction. Through quantitative evaluation and qualitative error analyses, we find improvements in spelling correction utilizing the RAG framework beyond a stand-alone LLM. We also demonstrate the value of additional finetuning of the LLM to incorporate retrieved context.
arXiv.org Artificial Intelligence
Oct-15-2024
- Country:
- Asia
- Indonesia > Bali (0.04)
- Middle East > Jordan (0.04)
- Singapore (0.04)
- Europe > Austria
- Vienna (0.04)
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- United States
- California > Santa Clara County
- Palo Alto (0.04)
- Maryland > Baltimore (0.04)
- New York > New York County
- New York City (0.04)
- California > Santa Clara County
- Canada
- Asia
- Genre:
- Research Report (0.40)
- Industry:
- Information Technology > Services > e-Commerce Services (0.62)
- Technology: