AITopics | train size 2

Collaborating Authors

train size 2

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Leveraging Auxiliary Domain Parallel Data in Intermediate Task Fine-tuning for Low-resource Translation

Nayak, Shravan, Ranathunga, Surangika, Thillainathan, Sarubi, Hung, Rikki, Rinaldi, Anthony, Wang, Yining, Mackey, Jonah, Ho, Andrew, Lee, En-Shiun Annie

arXiv.org Artificial IntelligenceSep-23-2023

NMT systems trained on Pre-trained Multilingual Sequence-Sequence (PMSS) models flounder when sufficient amounts of parallel data is not available for finetuning. This specifically holds for languages missing/under-represented in these models. The problem gets aggravated when the data comes from different domains. In this paper, we show that intermediate-task fine-tuning (ITFT) of PMSS models is extremely beneficial for domain-specific NMT, especially when target domain data is limited/unavailable and the considered languages are missing or under-represented in the PMSS model. We quantify the domain-specific results variations using a domain-divergence test, and show that ITFT can mitigate the impact of domain divergence to some extent. Pre-trained Multilingual Sequence-Sequence (PMSS) models such as mBART (Tang et al., 2021) and mT5 (Xue et al., 2021) have shown considerable promise over vanilla Transformer models for Neural Machine Translation (NMT).

computational linguistic, language pair, train size 2, (13 more...)

arXiv.org Artificial Intelligence

2306.01382

Country:

North America > Canada > Ontario > Toronto (0.14)
Asia > India (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(10 more...)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback