CONGRAD:Conflicting Gradient Filtering for Multilingual Preference Alignment

Li, Jiangnan, Vu, Thuy-Trang, Herold, Christian, Tebbifakhr, Amirhossein, Khadivi, Shahram, Haffari, Gholamreza

Mar-31-2025–arXiv.org Artificial Intelligence

Naive joint training of large language models (LLMs) for multilingual preference alignment can suffer from negative interference. This is a known issue in multilingual training, where conflicting objectives degrade overall performance. However, the impact of this phenomenon in the context of multilingual preference alignment remains largely underexplored. To address this issue, we propose CONGRAD, a scalable and effective filtering method that selects high-quality preference samples with minimal gradient conflicts across languages. Our method leverages gradient surgery to retain samples aligned with an aggregated multilingual update direction. Additionally, we incorporate a sublinear gradient compression strategy that reduces memory overhead during gradient accumulation. We integrate CONGRAD into self-rewarding framework and evaluate on LLaMA3-8B and Gemma2-2B across 10 languages. Results show that CONGRAD consistently outperforms strong baselines in both seen and unseen languages, with minimal alignment tax.

computational linguistic, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Mar-31-2025

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia (0.04)
- North America
  - Dominican Republic (0.04)
  - Canada (0.04)
  - United States > Florida
    - Miami-Dade County > Miami (0.04)
  - Mexico > Mexico City
    - Mexico City (0.04)
- Europe
  - Spain (0.04)
  - Middle East > Malta
    - Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Asia
  - Singapore (0.04)
  - Middle East > Jordan (0.04)
  - Indonesia > Bali (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)

Genre:
- Research Report > New Finding (0.66)

Industry:
- Government (0.46)
- Education > Curriculum
  - Subject-Specific Education (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found