Readability Reconsidered: A Cross-Dataset Analysis of Reference-Free Metrics
Belem, Catarina G, Glenn, Parker, Samuel, Alfy, Kumar, Anoop, Liu, Daben
–arXiv.org Artificial Intelligence
Automatic readability assessment plays a key role in ensuring effective and accessible written communication. Despite significant progress, the field is hindered by inconsistent definitions of readability and measurements that rely on surface-level text properties. In this work, we investigate the factors shaping human perceptions of readability through the analysis of 897 judgments, finding that, beyond surface-level cues, information content and topic strongly shape text comprehensibility. Furthermore, we evaluate 15 popular readability metrics across five English datasets, contrasting them with six more nuanced, model-based metrics. Our results show that four model-based metrics consistently place among the top four in rank correlations with human judgments, while the best performing traditional metric achieves an average rank of 8.6. These findings highlight a mismatch between current readability metrics and human perceptions, pointing to model-based approaches as a more promising direction.
arXiv.org Artificial Intelligence
Oct-20-2025
- Country:
- Asia (1.00)
- North America > United States
- California (0.28)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Media > Film (0.68)
- Leisure & Entertainment (0.68)
- Law (0.68)
- Government (0.68)
- Health & Medicine
- Consumer Health (0.93)
- Therapeutic Area > Cardiology/Vascular Diseases (0.68)
- Education
- Curriculum > Subject-Specific Education (1.00)
- Educational Setting > K-12 Education (0.95)
- Technology: