Local Superior Soups: A Catalyst for Model Merging in Cross-Silo Federated Learning
–Neural Information Processing Systems
Federated learning (FL) is a learning paradigm that enables collaborative training of models using decentralized data. Recently, the utilization of pre-trained weight initialization in FL has been demonstrated to effectively improve model performance. However, the evolving complexity of current pre-trained models, characterized by a substantial increase in parameters, markedly intensifies the challenges associated with communication rounds required for their adaptation to FL. To address these communication cost issues and increase the performance of pre-trained model adaptation in FL, we propose an innovative model interpolation-based local training technique called "Local Superior Soups." Our method enhances local training across different clients, encouraging the exploration of a connected low-loss basin within a few communication rounds through regularized model interpolation.
Neural Information Processing Systems
May-28-2025, 19:24:42 GMT
- Country:
- North America
- Canada (0.14)
- United States (0.14)
- North America
- Genre:
- Research Report
- Experimental Study (1.00)
- Promising Solution (0.66)
- Research Report
- Industry:
- Education (0.55)
- Information Technology (0.67)
- Materials > Chemicals
- Specialty Chemicals (0.40)
- Technology: