ChaoticRegularizationand Heavy-Tailed Limitsfor DeterministicGradientDescent
–Neural Information Processing Systems
However, to obtain the desired effect, the step-size should be chosen sufficiently large, a task which is problem dependent andcanbedifficultinpractice.
Neural Information Processing Systems
Feb-11-2026, 06:27:18 GMT