Data Efficacy for Language Model Training