Sketchy Moment Matching: Toward Fast and Provable Data Selection for Finetuning
–Neural Information Processing Systems
We revisit data selection in a modern context of finetuning from a fundamental perspective. Extending the classical wisdom of variance minimization in low dimensions to high-dimensional finetuning, our generalization analysis unveils the importance of additionally reducing bias induced by low-rank approximation. Inspired by the variance-bias tradeoff in high dimensions from the theory, we introduce Sketchy Moment Matching (SkMM), a scalable data selection scheme with two stages. Theoretically, we show that gradient sketching is fast and provably accurate: selecting n samples by reducing variance over \mathcal{S} preserves the fast-rate generalization O(\dim(\mathcal{S})/n), independent of the parameter dimension. Empirically, we concretize the variance-bias balance via synthetic experiments and demonstrate the effectiveness of SkMM for finetuning in real vision tasks.
Neural Information Processing Systems
May-27-2025, 00:41:31 GMT
- Technology: