Time-Reversed Dissipation Induces Duality Between Minimizing Gradient Norm and Function Value
–Neural Information Processing Systems
In convex optimization, first-order optimization methods efficiently minimizing function values have been a central subject study since Nesterov's seminal work of 1983. Recently, however, Kim and Fessler's OGM-G and Lee et al.'s FISTA-G have been presented as alternatives that efficiently minimize the gradient magnitude instead. In this paper, we present H-duality, which represents a surprising one-to-one correspondence between methods efficiently minimizing function values and methods efficiently minimizing gradient magnitude.
Neural Information Processing Systems
Dec-25-2025, 00:47:05 GMT
- Technology: