Data Mixture in Training Un-assures Out-of-Distribution Generalization