CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging

Sun, Wenju, Li, Qingyong, Geng, Yangli-ao, Li, Boyang

May-15-2025–arXiv.org Artificial Intelligence

Multi-task model merging offers a promising paradigm for integrating multiple expert models into a unified model without additional training. Existing state-of-the-art techniques, such as Task Arithmetic and its variants, merge models by accumulating task vectors -- the parameter differences between pretrained and finetuned models. However, task vector accumulation is often hindered by knowledge conflicts, leading to performance degradation. To address this challenge, we propose Conflict-Aware Task Merging (CAT Merging), a novel training-free framework that selectively trims conflict-prone components from the task vectors. CAT Merging introduces several parameter-specific strategies, including projection for linear weights and masking for scaling and shifting parameters in normalization layers. Extensive experiments on vision, language, and vision-language tasks demonstrate that CAT Merging effectively suppresses knowledge conflicts, achieving average accuracy improvements of up to 2.5% (ViT-B/32) and 2.0% (ViT-L/14) over state-of-the-art methods.

artificial intelligence, machine learning, natural language, (13 more...)

arXiv.org Artificial Intelligence

May-15-2025

arXiv.org PDF

Add feedback

Country:
- North America > Canada (0.04)
- Asia
  - Singapore (0.04)
  - China
    - Beijing > Beijing (0.04)
    - Guangdong Province > Shenzhen (0.04)

Genre:
- Research Report > Promising Solution (0.86)

Technology:
- Information Technology
  - Data Science (1.00)
  - Artificial Intelligence
    - Vision (1.00)
    - Representation & Reasoning (1.00)
    - Natural Language (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found