System Report for CCL24-Eval Task 7: Multi-Error Modeling and Fluency-Targeted Pre-training for Chinese Essay Evaluation

Zhang, Jingshen, Yang, Xiangyu, Su, Xinkai, Chen, Xinglu, Huang, Tianyou, Qiu, Xinying

Jul-11-2024–arXiv.org Artificial Intelligence

This system report presents our approaches and results for the Chinese Essay Fluency Evaluation (CEFE) task at CCL-2024. For Track 1, we optimized predictions for challenging fine-grained error types using binary classification models and trained coarse-grained models on the Chinese Learner 4W corpus. In Track 2, we enhanced performance by constructing a pseudo-dataset with multiple error types per sentence. For Track 3, where we achieved first place, we generated fluency-rated pseudo-data via back-translation for pre-training and used an NSP-based strategy with Symmetric Cross Entropy loss to capture context and mitigate long dependencies. Our methods effectively address key challenges in Chinese Essay Fluency Evaluation.

corpus, error type, evaluation, (15 more...)

arXiv.org Artificial Intelligence

Jul-11-2024

arXiv.org PDF

Add feedback

Country:
- Asia
  - Singapore (0.04)
  - Myanmar > Tanintharyi Region
    - Dawei (0.04)
  - China > Guangdong Province
    - Guangzhou (0.05)

Genre:
- Research Report (0.50)

Industry:
- Education > Assessment & Standards > Student Performance (0.50)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found