Chinese grammatical error correction based on knowledge distillation

Xia, Peng, Zhou, Yuechi, Zhang, Ziyan, Tang, Zecheng, Li, Juntao

Aug-31-2022–arXiv.org Artificial Intelligence

In view of the poor robustness of existing Chinese grammatical error correction models on attack test sets and large model parameters, this paper uses the method of knowledge distillation to compress model parameters and improve the anti-attack ability of the model. In terms of data, the attack test set is constructed by integrating the disturbance into the standard evaluation data set, and the model robustness is evaluated by the attack test set. The experimental results show that the distilled small model can ensure the performance and improve the training speed under the condition of reducing the number of model parameters, and achieve the optimal effect on the attack test set, and the robustness is significantly improved. Code is available at https://github.com/Richard88888/KD-CGEC.

correction, error correction, grammatical error correction, (14 more...)

arXiv.org Artificial Intelligence

Aug-31-2022

arXiv.org PDF

Add feedback

Country:
- Europe > Spain (0.04)
- Oceania > Australia
  - New South Wales > Sydney (0.04)
- North America > United States
  - Illinois (0.04)
- Asia > China
  - Jiangsu Province (0.05)

Genre:
- Research Report (0.85)

Technology:
- Information Technology
  - Data Science > Data Quality
    - Data Cleaning (0.68)
  - Artificial Intelligence
    - Natural Language > Grammars & Parsing (0.76)
    - Machine Learning > Neural Networks
      - Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found