Mitigating Catastrophic Forgetting in Language Transfer via Model Merging