LengClaro2023: A Dataset of Administrative Texts in Spanish with Plain Language adaptations
Agüera-Marco, Belén, Gonzalez-Dios, Itziar
–arXiv.org Artificial Intelligence
In this work, we present LengClaro2023, a dataset of legal-administrative texts in Spanish. Based on the most frequently used procedures from the Spanish Social Security website, we have created for each text two simplified equivalents. The first version follows the recommendations provided by arText claro. The second version incorporates additional recommendations from plain language guidelines to explore further potential improvements in the system. The linguistic resource created in this work can be used for evaluating automatic text simplification (ATS) systems in Spanish.
arXiv.org Artificial Intelligence
Jun-9-2025
- Country:
- Asia > Middle East
- UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
- Europe
- North America > United States
- California > Los Angeles County
- El Segundo (0.04)
- Colorado (0.04)
- California > Los Angeles County
- South America > Argentina (0.04)
- Asia > Middle East
- Genre:
- Overview (0.67)
- Research Report (0.63)
- Industry:
- Government (1.00)
- Health & Medicine > Therapeutic Area (0.46)
- Law (1.00)
- Technology: