An Extended Sequence Tagging Vocabulary for Grammatical Error Correction
Mesham, Stuart, Bryant, Christopher, Rei, Marek, Yuan, Zheng
–arXiv.org Artificial Intelligence
We extend a current sequence-tagging approach to Grammatical Error Correction (GEC) by introducing specialised tags for spelling correction and morphological inflection using the SymSpell and LemmInflect algorithms. Our approach improves generalisation: the proposed new tagset allows a smaller number of tags to correct a larger range of errors. Our results show a performance improvement both overall and in the targeted error categories. We further show that ensembles trained with our new tagset outperform those trained with the baseline tagset on the public BEA benchmark.
arXiv.org Artificial Intelligence
Feb-12-2023
- Country:
- North America
- United States
- Maryland > Baltimore (0.04)
- Washington > King County
- Seattle (0.14)
- Oregon > Multnomah County
- Portland (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- California > San Diego County
- San Diego (0.04)
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Italy > Tuscany
- Florence (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- United Kingdom > England
- Asia
- South Korea (0.04)
- China > Hong Kong (0.04)
- Thailand > Chiang Mai
- Chiang Mai (0.04)
- North America
- Genre:
- Research Report > New Finding (1.00)
- Technology: