Stepping on the Edge: Curvature A ware Learning Rate Tuners
–Neural Information Processing Systems
(Liu and Nocedal, 1989). Similar efforts have been made for Polyak stepsizes (Berrada et al., 2020; Loizou et al., 2021), in addition to new methods which combine distance to optimality with online learning convergence bounds (Cutkosky et al., 2023; Classically-inspired methods, however, have generally struggled to gain traction in deep learning.
Neural Information Processing Systems
Feb-13-2026, 14:36:34 GMT
- Country:
- Europe > Russia (0.04)
- North America > Canada
- Asia
- Russia (0.04)
- Middle East > Jordan (0.04)
- Genre:
- Research Report > New Finding (0.93)
- Industry:
- Education > Educational Setting > Online (0.48)
- Technology: