Progressive Multi-task Learning Framework for Chinese Text Error Correction
Ma, Shirong, Li, Yinghui, Huang, Haojing, Huang, Shulin, Li, Yangning, Zheng, Hai-Tao, Shen, Ying
–arXiv.org Artificial Intelligence
Chinese Text Error Correction (CTEC) aims to detect and correct errors in the input text, which benefits human's daily life and various downstream tasks. Recent approaches mainly employ Pre-trained Language Models (PLMs) to resolve CTEC task and achieve tremendous success. However, previous approaches suffer from issues of over-correction and under-correction, and the former is especially conspicuous in the precision-critical CTEC task. To mitigate the issue of overcorrection, we propose a novel model-agnostic progressive multitask learning framework for CTEC, named ProTEC, which guides a CTEC model to learn the task from easy to difficult. We divide CTEC task into three sub-tasks from easy to difficult: Error Detection, Error Type Identification, and Correction Result Generation. During the training process, ProTEC guides the model to learn text error correction progressively by incorporating these sub-tasks into a multi-task training objective. During the inference process, the model completes these sub-tasks in turn to generate the correction results. Extensive experiments and detailed analyses fully demonstrate the effectiveness and efficiency of our proposed framework.
arXiv.org Artificial Intelligence
Jul-3-2023
- Country:
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Asia
- China
- Beijing > Beijing (0.04)
- Guangdong Province
- Inner Mongolia > Hohhot (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Taiwan (0.04)
- China
- Europe
- France > Île-de-France
- Hauts-de-Seine > Nanterre (0.04)
- Paris > Paris (0.04)
- United Kingdom > England
- West Midlands > Wolverhampton (0.04)
- France > Île-de-France
- North America > United States (0.04)
- Africa > Ethiopia
- Genre:
- Research Report
- New Finding (0.46)
- Promising Solution (0.48)
- Research Report
- Industry:
- Education > Educational Setting > Higher Education (0.69)
- Technology: