BLISS: A Lightweight Bilevel Influence Scoring Method for Data Selection in Language Model Pretraining

Open in new window