An Efficient Approach for Studying Cross-Lingual Transfer in Multilingual Language Models

Mar-29-2024–arXiv.org Artificial Intelligence

The capacity and effectiveness of pre-trained multilingual models (MLMs) for zero-shot cross-lingual transfer is well established. However, phenomena of positive or negative transfer, and the effect of language choice still need to be fully understood, especially in the complex setting of massively multilingual LMs. We propose an efficient method to study transfer language influence in zero-shot performance on another target language. Unlike previous work, our approach disentangles downstream tasks from language, using dedicated adapter units. Our findings suggest that some languages do not largely affect others, while some languages, especially ones unseen during pre-training, can be extremely beneficial or detrimental for different target languages. We find that no transfer language is beneficial for all target languages. We do, curiously, observe languages previously unseen by MLMs consistently benefit from Figure 1: Our approach uses efficient few-step continued transfer from almost any language. We additionally tuning (left) and adapter modules (right) to disentangle use our modular approach to quantify the effect of task and language to quantify the effect negative interference efficiently and catagorize of a transfer language for a given task and model.

computational linguistic, target language, transfer language, (15 more...)

arXiv.org Artificial Intelligence

Mar-29-2024

arXiv.org PDF

Add feedback

Country:
- Africa > Niger (0.05)
- South America
  - Brazil (0.04)
  - Peru > Cusco Department
    - Cusco Province > Cusco (0.04)
- North America
  - United States
    - Washington > King County
      - Seattle (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
- Europe
  - Russia (0.04)
  - Belgium (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - Hungary > Csongrád-Csanád County
    - Szeged (0.04)
- Asia
  - Russia (0.04)
  - Philippines (0.04)
  - Middle East > Jordan (0.04)
  - Japan > Honshū
    - Kansai > Kyoto Prefecture > Kyoto (0.04)

Genre:
- Research Report > New Finding (0.86)

Industry:
- Education (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Natural Language
    - Text Processing (0.46)
    - Large Language Model (0.44)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found