A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters