Merging Multi-Task Models via Weight-Ensembling Mixture of Experts

Open in new window