Efficient Sketches for Training Data Attribution and Studying the Loss Landscape
–Neural Information Processing Systems
Traditional sketching methods struggle to scale under these memory constraints. We present a novel framework for scalable gradient and HVP sketching, tailored for modern hardware. We provide theoretical guarantees and demonstrate the power of our methods in applications like training data attribution, Hessian spectrum analysis, and intrinsic dimension computation for pre-trained language models.
Neural Information Processing Systems
May-29-2025, 07:34:23 GMT
- Country:
- Europe > Netherlands (0.14)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.67)
- Research Report
- Industry:
- Information Technology (0.46)
- Technology: