MuRating: AHigh Quality Data Selecting Approach to Multilingual Large Language Model Pretraining

Open in new window