5ac8bb8a7d745102a978c5f8ccdb61b8-AuthorFeedback.pdf

Oct-9-2025, 14:10:05 GMT–Neural Information Processing Systems

We thank the reviewers for extremely helpful suggestions. Below we discuss the most significant points. In each graph the predictions (in orange) were scaled by a single multiplicative constant to fit the measurements. This constant reflects the length of each gradient step (e.g., due to the learning rate and size of training set). We will explain that our results rely on lazy training induced by overparameterization. Chizat et al. provide We will discuss this interesting question.

artificial intelligence, experiment, machine learning, (14 more...)

Neural Information Processing Systems

Oct-9-2025, 14:10:05 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.90)

Duplicate Docs Excel Report

Title
5ac8bb8a7d745102a978c5f8ccdb61b8-AuthorFeedback.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found