Decouple-Then-Merge: Towards Better Training for Diffusion Models