cgec model
Loss-Aware Curriculum Learning for Chinese Grammatical Error Correction
Zhang, Ding, Li, Yangning, Bai, Lichen, Zhang, Hao, Li, Yinghui, Lin, Haiye, Zheng, Hai-Tao, Su, Xin, Shan, Zifei
Chinese grammatical error correction (CGEC) aims to detect and correct errors in the input Chinese sentences. Recently, Pre-trained Language Models (PLMS) have been employed to improve the performance. However, current approaches ignore that correction difficulty varies across different instances and treat these samples equally, enhancing the challenge of model learning. To address this problem, we propose a multi-granularity Curriculum Learning (CL) framework. Specifically, we first calculate the correction difficulty of these samples and feed them into the model from easy to hard batch by batch. Then Instance-Level CL is employed to help the model optimize in the appropriate direction automatically by regulating the loss function. Extensive experimental results and comprehensive analyses of various datasets prove the effectiveness of our method.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Asia > China > Guangdong Province > Shenzhen (0.08)
- North America > Dominican Republic (0.04)
- (6 more...)
- Information Technology > Artificial Intelligence > Machine Learning (1.00)
- Information Technology > Data Science > Data Quality > Data Cleaning (0.64)
- Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.64)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)