Improving Estonian Text Simplification through Pretrained Language Models and Custom Datasets

Barbu, Eduard, Muru, Meeri-Ly, Malva, Sten Marcus

Jan-26-2025–arXiv.org Artificial Intelligence

This study introduces an approach to Estonian text simplification using two model architectures: a neural machine translation model and a fine-tuned large language model (LLaMA). Given the limited resources for Estonian, we developed a new dataset, the Estonian Simplification Dataset, combining translated data and GPT-4.0-generated simplifications. We benchmarked OpenNMT, a neural machine translation model that frames text simplification as a translation task, and fine-tuned the LLaMA model on our dataset to tailor it specifically for Estonian simplification. Manual evaluations on the test set show that the LLaMA model consistently outperforms OpenNMT in readability, grammaticality, and meaning preservation. These findings underscore the potential of large language models for low-resource languages and provide a basis for further research in Estonian text simplification.

large language model, machine learning, simplification, (16 more...)

arXiv.org Artificial Intelligence

Jan-26-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Pennsylvania > Philadelphia County
    - Philadelphia (0.04)
  - New York > New York County
    - New York City (0.04)
  - California > Los Angeles County
    - Los Angeles (0.14)
- Europe
  - Switzerland (0.04)
  - United Kingdom > England
    - South Yorkshire > Sheffield (0.04)
  - Greece > Central Macedonia
    - Thessaloniki (0.04)
  - Estonia > Tartu County
    - Tartu (0.05)
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - Bulgaria
    - Varna Province > Varna (0.04)
    - Sofia City Province > Sofia (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia > China
  - Heilongjiang Province > Daqing (0.04)
  - Beijing > Beijing (0.04)

Genre:
- Overview (0.93)
- Research Report > New Finding (0.88)

Industry:
- Education (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Machine Translation (1.00)
    - Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found