MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning

Yang, Yaming, Muhtar, Dilxat, Shen, Yelong, Zhan, Yuefeng, Liu, Jianfeng, Wang, Yujing, Sun, Hao, Deng, Denvy, Sun, Feng, Zhang, Qi, Chen, Weizhu, Tong, Yunhai

Oct-15-2024–arXiv.org Artificial Intelligence

Parameter-efficient fine-tuning (PEFT) has been widely employed for domain adaptation, with LoRA being one of the most prominent methods due to its simplicity and effectiveness. However, in multi-task learning (MTL) scenarios, LoRA tends to obscure the distinction between tasks by projecting sparse high-dimensional features from different tasks into the same dense low-dimensional intrinsic space. This leads to task interference and suboptimal performance for LoRA and its variants. To tackle this challenge, we propose MTL-LoRA, which retains the advantages of low-rank adaptation while significantly enhancing multi-task learning capabilities. MTL-LoRA augments LoRA by incorporating additional task-adaptive parameters that differentiate task-specific information and effectively capture shared knowledge across various tasks within low-dimensional spaces. This approach enables large language models (LLMs) pre-trained on general corpus to adapt to different target task domains with a limited number of trainable parameters. Comprehensive experimental results, including evaluations on public academic benchmarks for natural language understanding, commonsense reasoning, and image-text understanding, as well as real-world industrial text Ads relevance datasets, demonstrate that MTL-LoRA outperforms LoRA and its various variants with comparable or even fewer learnable parameters in multitask learning.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

Oct-15-2024

arXiv.org PDF

Add feedback

Country:
- Africa > Seychelles (0.04)
- Europe
  - France (0.04)
  - Romania > Sud - Muntenia Development Region
    - Giurgiu County > Giurgiu (0.04)
  - Italy > Tuscany
    - Florence (0.04)
- Asia > China
  - Jiangsu Province > Nanjing (0.04)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine (1.00)
- Information Technology (0.68)
- Banking & Finance > Trading (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (0.92)
  - Machine Learning > Neural Networks
    - Deep Learning (0.49)