Scalable Model Merging with Progressive Layer-wise Distillation