Reviews: Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two Layers
–Neural Information Processing Systems
I thank the authors for their response. I understand that generalization is not the major contribution in this paper -- thanks for the note. I also appreciate the plot showing the numerical values of the weight norms for varying width. It is reassuring to know that these quantities do vary inversely with width for this setting. I think adding these sorts of plots to the appendix of the paper (with a bit more detailed experimentation and discussion) would be useful for the paper.
Neural Information Processing Systems
Jan-24-2025, 05:59:54 GMT
- Technology: