Self-Influence Guided Data Reweighting for Language Model Pre-training