CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging
Sun, Wenju, Li, Qingyong, Geng, Yangli-ao, Li, Boyang
–arXiv.org Artificial Intelligence
Multi-task model merging offers a promising paradigm for integrating multiple expert models into a unified model without additional training. Existing state-of-the-art techniques, such as Task Arithmetic and its variants, merge models by accumulating task vectors -- the parameter differences between pretrained and finetuned models. However, task vector accumulation is often hindered by knowledge conflicts, leading to performance degradation. To address this challenge, we propose Conflict-Aware Task Merging (CAT Merging), a novel training-free framework that selectively trims conflict-prone components from the task vectors. CAT Merging introduces several parameter-specific strategies, including projection for linear weights and masking for scaling and shifting parameters in normalization layers. Extensive experiments on vision, language, and vision-language tasks demonstrate that CAT Merging effectively suppresses knowledge conflicts, achieving average accuracy improvements of up to 2.5% (ViT-B/32) and 2.0% (ViT-L/14) over state-of-the-art methods.
arXiv.org Artificial Intelligence
May-15-2025
- Country:
- North America > Canada (0.04)
- Asia
- Genre:
- Research Report > Promising Solution (0.86)
- Technology:
- Information Technology
- Data Science (1.00)
- Artificial Intelligence
- Vision (1.00)
- Representation & Reasoning (1.00)
- Natural Language (1.00)
- Machine Learning > Neural Networks
- Deep Learning (0.46)
- Information Technology