A Chat About Boring Problems: Studying GPT-based text normalization

Zhang, Yang, Bartley, Travis M., Graterol-Fuenmayor, Mariana, Lavrukhin, Vitaly, Bakhturina, Evelina, Ginsburg, Boris

Jan-17-2024–arXiv.org Artificial Intelligence

Text normalization - the conversion of text from written to spoken form - is traditionally assumed to be an ill-formed task for language models. In this work, we argue otherwise. We empirically show the capacity of Large-Language Models (LLM) for text normalization in few-shot scenarios. Combining self-consistency reasoning with linguistic-informed prompt engineering, we find LLM based text normalization to achieve error rates around 40\% lower than top normalization systems. Further, upon error analysis, we note key limitations in the conventional design of text normalization tasks. We create a new taxonomy of text normalization errors and apply it to results from GPT-3.5-Turbo and GPT-4.0. Through this new framework, we can identify strengths and weaknesses of GPT-based TN, opening opportunities for future work.

normalization, text normalization, unrecoverable error, (12 more...)

arXiv.org Artificial Intelligence

Jan-17-2024

arXiv.org PDF

Add feedback

Country:
- Asia > South Korea (0.04)
- North America > United States
  - New York (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
- Europe
  - France (0.05)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (0.51)
    - Performance Analysis > Accuracy (0.34)