Generalization Bounds for Gradient Methods via Discrete and Continuous Prior Xuanyuan Luo

Jan-26-2025, 20:11:40 GMT–Neural Information Processing Systems

Proving algorithm-dependent generalization error bounds for gradient-type optimization methods has attracted significant attention recently in learning theory. However, most existing trajectory-based analyses require either restrictive assumptions on the learning rate (e.g., fast decreasing learning rate), or continuous injected noise (such as the Gaussian noise in Langevin dynamics).

artificial intelligence, generalization, machine learning, (15 more...)

Neural Information Processing Systems

Jan-26-2025, 20:11:40 GMT

Conferences PDF

Add feedback

Country:
- Asia (0.28)
- North America > United States (0.46)

Genre:
- Research Report (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Computational Learning Theory (0.49)
    - Learning Graphical Models (0.46)
    - Neural Networks (0.48)
    - Statistical Learning > Gradient Descent (0.32)
  - Representation & Reasoning > Optimization (0.35)