Redefining Simplicity: Benchmarking Large Language Models from Lexical to Document Simplification

Qiang, Jipeng, Huang, Minjiang, Zhu, Yi, Yuan, Yunhao, Zhang, Chaowei, Yu, Kui

Feb-12-2025–arXiv.org Artificial Intelligence

Text simplification (TS) refers to the process of reducing the complexity of a text while retaining its original meaning and key information. Existing work only shows that large language models (LLMs) have outperformed supervised non-LLM-based methods on sentence simplification. This study offers the first comprehensive analysis of LLM performance across four TS tasks: lexical, syntactic, sentence, and document simplification. We compare lightweight, closed-source and open-source LLMs against traditional non-LLM methods using automatic metrics and human evaluations. Our experiments reveal that LLMs not only outperform non-LLM approaches in all four tasks but also often generate outputs that exceed the quality of existing human-annotated references. Finally, we present some future directions of TS in the era of LLMs.

large language model, machine learning, simplification, (17 more...)

arXiv.org Artificial Intelligence

Feb-12-2025

arXiv.org PDF

Add feedback

Country:
- Asia
  - China (0.14)
  - Middle East > UAE
    - Abu Dhabi Emirate > Abu Dhabi (0.14)
- Europe
  - Denmark (0.14)
  - France (0.14)

Genre:
- Research Report > New Finding (0.67)

Industry:
- Education (0.68)
- Information Technology > Security & Privacy (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.51)
  - Natural Language > Large Language Model (1.00)