BLISS: A Lightweight Bilevel Influence Scoring Method for Data Selection in Language Model Pretraining