Stability and Generalization Analysis of Gradient Methods for Shallow Neural Networks Yunwen Lei

Aug-19-2025, 21:46:28 GMT–Neural Information Processing Systems

While significant theoretical progress has been achieved, unveiling the generalization mystery of overparameterized neural networks still remains largely elusive. In this paper, we study the generalization behavior of shallow neural networks (SNNs) by leveraging the concept of algorithmic stability. We consider gradient descent (GD) and stochastic gradient descent (SGD) to train SNNs, for both of which we develop consistent excess risk bounds by balancing the optimization and generalization via early-stopping.

artificial intelligence, generalization, machine learning, (13 more...)

Neural Information Processing Systems

Aug-19-2025, 21:46:28 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia > China
  - Hong Kong > Kowloon (0.04)

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks (1.00)
  - Statistical Learning > Gradient Descent (0.78)

Duplicate Docs Excel Report

Title
fb8fe6b79288f3d83696a5d276f4fc9d-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found