TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training

Open in new window