We thank all of the reviewers for their thoughtful feedback, and will incorporate their suggestions into the next version

Neural Information Processing Systems 

We thank R1 for their comments and will emphasize the broader implications of our work on model explainability. R2 asked to contrast using (i) influence functions to measure the importance of training points with (ii) existing These papers address a different problem setting from ours and their methods are correspondingly distinct. Despite their differences, these methods could be complementary, as R2 suggested. We will include this discussion and we thank R2 for pointing it out. R3 asked if our empirical findings hold for non-convex models.