Readability Reconsidered: A Cross-Dataset Analysis of Reference-Free Metrics

Belem, Catarina G, Glenn, Parker, Samuel, Alfy, Kumar, Anoop, Liu, Daben

Oct-20-2025–arXiv.org Artificial Intelligence

Automatic readability assessment plays a key role in ensuring effective and accessible written communication. Despite significant progress, the field is hindered by inconsistent definitions of readability and measurements that rely on surface-level text properties. In this work, we investigate the factors shaping human perceptions of readability through the analysis of 897 judgments, finding that, beyond surface-level cues, information content and topic strongly shape text comprehensibility. Furthermore, we evaluate 15 popular readability metrics across five English datasets, contrasting them with six more nuanced, model-based metrics. Our results show that four model-based metrics consistently place among the top four in rank correlations with human judgments, while the best performing traditional metric achieves an average rank of 8.6. These findings highlight a mismatch between current readability metrics and human perceptions, pointing to model-based approaches as a more promising direction.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Oct-20-2025

arXiv.org PDF

Add feedback

Country:
- Africa > Sub-Saharan Africa (0.04)
- Asia
  - China > Beijing
    - Beijing (0.04)
  - Indonesia > Sumatra (0.04)
  - Middle East
    - Jordan (0.04)
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.04)
  - South Korea (0.04)
- Atlantic Ocean > North Atlantic Ocean
  - Hudson Bay (0.04)
- Europe
  - Austria > Vienna (0.14)
  - Bulgaria (0.04)
- North America
  - Canada > Ontario
    - Toronto (0.04)
  - Mexico > Mexico City
    - Mexico City (0.04)
  - United States
    - California
      - Orange County > Irvine (0.04)
      - San Diego County > San Diego (0.04)
    - Florida > Miami-Dade County
      - Miami (0.04)
    - Hawaii > Honolulu County
      - Honolulu (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
- Oceania > Australia (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Education
  - Curriculum > Subject-Specific Education (1.00)
  - Educational Setting > K-12 Education (0.95)
- Government (0.68)
- Health & Medicine
  - Consumer Health (0.93)
  - Therapeutic Area > Cardiology/Vascular Diseases (0.68)
- Law (0.68)
- Leisure & Entertainment (0.68)
- Media > Film (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Natural Language > Large Language Model (0.98)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found