A Chat About Boring Problems: Studying GPT-based text normalization
Zhang, Yang, Bartley, Travis M., Graterol-Fuenmayor, Mariana, Lavrukhin, Vitaly, Bakhturina, Evelina, Ginsburg, Boris
–arXiv.org Artificial Intelligence
Text normalization - the conversion of text from written to spoken form - is traditionally assumed to be an ill-formed task for language models. In this work, we argue otherwise. We empirically show the capacity of Large-Language Models (LLM) for text normalization in few-shot scenarios. Combining self-consistency reasoning with linguistic-informed prompt engineering, we find LLM based text normalization to achieve error rates around 40\% lower than top normalization systems. Further, upon error analysis, we note key limitations in the conventional design of text normalization tasks. We create a new taxonomy of text normalization errors and apply it to results from GPT-3.5-Turbo and GPT-4.0. Through this new framework, we can identify strengths and weaknesses of GPT-based TN, opening opportunities for future work.
arXiv.org Artificial Intelligence
Jan-17-2024
- Country:
- Asia > South Korea (0.04)
- North America > United States
- New York (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Europe
- France (0.05)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Genre:
- Research Report (0.40)
- Technology: