Enhancing Stability for Large Models Training in Constrained Bandwidth Networks