Beyond Effi ciency: Molecular Data Pruning for Enhanced Generalization