Can Uniform Meaning Representation Help GPT-4 Translate from Indigenous Languages?

Feb-12-2025–arXiv.org Artificial Intelligence

While ChatGPT and GPT-based models are able to effectively perform many tasks without additional fine-tuning, they struggle with related to extremely low-resource languages and indigenous languages. Uniform Meaning Representation (UMR), a semantic representation designed to capture the meaning of texts in many languages, is well-poised to be leveraged in the development of low-resource language technologies. In this work, we explore the downstream technical utility of UMR for low-resource languages by incorporating it into GPT-4 prompts. Specifically, we examine the ability of GPT-4 to perform translation from three indigenous languages (Navajo, Ar\'apaho, and Kukama), with and without demonstrations, as well as with and without UMR annotations. Ultimately we find that in the majority of our test cases, integrating UMR into the prompt results in a statistically significant increase in performance, which is a promising indication of future applications of the UMR formalism.

large language model, machine learning, translation, (21 more...)

arXiv.org Artificial Intelligence

Feb-12-2025

arXiv.org PDF

Add feedback

Country:
- North America
  - Dominican Republic (0.04)
  - United States
    - District of Columbia > Washington (0.04)
    - Pennsylvania > Philadelphia County
      - Philadelphia (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
  - Italy
    - Tuscany > Florence (0.04)
    - Piedmont > Turin Province
      - Turin (0.04)
  - France > Grand Est
    - Meurthe-et-Moselle > Nancy (0.05)
  - Bulgaria > Sofia City Province
    - Sofia (0.04)
- Asia
  - Singapore (0.05)
  - Middle East > UAE (0.04)

Genre:
- Research Report
  - Experimental Study (0.67)
  - New Finding (0.48)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found