LLM-Based Evaluation of Low-Resource Machine Translation: A Reference-less Dialect Guided Approach with a Refined Sylheti-English Benchmark

Rahman, Md. Atiqur, Islam, Sabrina, Omi, Mushfiqul Haque

May-20-2025–arXiv.org Artificial Intelligence

Evaluating machine translation (MT) for low - resource languages poses a persistent challenge, primarily due to the limited availability of high - quality reference translations. This issue is further exacerbated in languages with multiple dialects, where linguistic diversity and data scarcity hinder robust evaluation. Large Language Models (LLMs) present a promising solution through reference - free evaluation techniques; however, their effectiveness diminishes in the absence of dialect - specific context and tailored guidance. In this work, we propose a comprehensive framework that enhances LLM - based MT evaluation using a dialect guided approach. We extend the ONUBAD dataset by incorporating Sylheti - English sentence pairs, corresponding machine - translations, and Direct Assessment (DA) scores annotated by native speakers. To address the vocabulary gap, we augment the tokenizer vocabulary with dialect - specific terms. We further introduce a regression head to enable scalar score prediction and design a dialect - guided (DG) prompting strategy. Our evaluation across multiple LLMs shows that the proposed pipeline consistently outperforms existing methods, achieving the highest gain of +0.1083 in Spear-man correlation, along with improvements across other evaluation settings. The dataset and the code are available at https://github.com/180041123 - Atiq/MTEonLowResourceLanguage .

artificial intelligence, large language model, natural language, (14 more...)

arXiv.org Artificial Intelligence

May-20-2025

arXiv.org PDF

Add feedback

Country:
- Asia
  - Middle East > Israel (0.04)
  - Bangladesh > Dhaka Division
    - Dhaka District > Dhaka (0.05)

Genre:
- Research Report (0.84)

Technology:
- Information Technology > Artificial Intelligence > Natural Language
  - Machine Translation (1.00)
  - Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found