Training Data Attribution via Approximate Unrolling

Apr-29-2026, 01:02:08 GMT–Neural Information Processing Systems

Many training data attribution (TDA) methods aim to estimate how a model's behavior would change if one or more data points were removed from the training set. Methods based on implicit differentiation, such as influence functions, can be made computationally efficient, but fail to account for underspecification, the implicit bias of the optimization algorithm, or multi-stage training pipelines. By contrast, methods based on unrolling address these issues but face scalability challenges.

artificial intelligence, machine learning, proceedings, (3 more...)

Neural Information Processing Systems

Apr-29-2026, 01:02:08 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.82)