Can Uniform Meaning Representation Help GPT-4 Translate from Indigenous Languages?
–arXiv.org Artificial Intelligence
While ChatGPT and GPT-based models are able to effectively perform many tasks without additional fine-tuning, they struggle with related to extremely low-resource languages and indigenous languages. Uniform Meaning Representation (UMR), a semantic representation designed to capture the meaning of texts in many languages, is well-poised to be leveraged in the development of low-resource language technologies. In this work, we explore the downstream technical utility of UMR for low-resource languages by incorporating it into GPT-4 prompts. Specifically, we examine the ability of GPT-4 to perform translation from three indigenous languages (Navajo, Ar\'apaho, and Kukama), with and without demonstrations, as well as with and without UMR annotations. Ultimately we find that in the majority of our test cases, integrating UMR into the prompt results in a statistically significant increase in performance, which is a promising indication of future applications of the UMR formalism.
arXiv.org Artificial Intelligence
Feb-12-2025
- Country:
- North America
- Dominican Republic (0.04)
- United States
- District of Columbia > Washington (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Canada > Ontario
- Toronto (0.04)
- Europe
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Italy
- Tuscany > Florence (0.04)
- Piedmont > Turin Province
- Turin (0.04)
- France > Grand Est
- Meurthe-et-Moselle > Nancy (0.05)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- Spain > Catalonia
- Asia
- Singapore (0.05)
- Middle East > UAE (0.04)
- North America
- Genre:
- Research Report
- Experimental Study (0.67)
- New Finding (0.48)
- Research Report
- Technology: