Real-Word Error Correction with Trigrams: Correcting Multiple Errors in a Sentence
–arXiv.org Artificial Intelligence
Spelling correction is a fundamental task in Text Mining. In this study, we assess the real-word error correction model proposed by Mays, Damerau and Mercer and describe several drawbacks of the model. We propose a new variation which focuses on detecting and correcting multiple real-word errors in a sentence, by manipulating a Probabilistic Context-Free Grammar (PCFG) to discriminate between items in the search space. We test our approach on the Wall Street Journal corpus and show that it outperforms Hirst and Budanitsky's WordNet-based method and Wilcox-O'Hearn, Hirst, and Budanitsky's fixed windows size method.-O'Hearn, Hirst, and Budanitsky's fixed windows size method.
arXiv.org Artificial Intelligence
Feb-7-2023
- Country:
- Asia > Middle East
- Iran > Kerman Province > Kerman (0.04)
- Europe
- Italy > Apulia
- Bari (0.04)
- Netherlands > Gelderland
- Nijmegen (0.04)
- Italy > Apulia
- North America > United States
- California > Santa Cruz County
- Santa Cruz (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- New York > New York County
- New York City (0.04)
- California > Santa Cruz County
- Asia > Middle East
- Genre:
- Research Report > New Finding (0.34)
- Technology: