ghbnode
- North America > United States > California > Los Angeles County > Los Angeles (0.14)
- North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
- Asia > Middle East > Jordan (0.04)
- (5 more...)
- North America > United States > California > Los Angeles County > Los Angeles (0.14)
- North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
- Asia > Middle East > Jordan (0.04)
- (5 more...)
AdamNODEs: When Neural ODE Meets Adaptive Moment Estimation
Cho, Suneghyeon, Hong, Sanghyun, Lee, Kookjin, Park, Noseong
Recent work by Xia et al. leveraged the continuous-limit of the classical momentum accelerated gradient descent and proposed heavy-ball neural ODEs. While this model offers computational efficiency and high utility over vanilla neural ODEs, this approach often causes the overshooting of internal dynamics, leading to unstable training of a model. Prior work addresses this issue by using ad-hoc approaches, e.g., bounding the internal dynamics using specific activation functions, but the resulting models do not satisfy the exact heavy-ball ODE. In this work, we propose adaptive momentum estimation neural ODEs (AdamNODEs) that adaptively control the acceleration of the classical momentum-based approach. We find that its adjoint states also satisfy AdamODE and do not require ad-hoc solutions that the prior work employs. In evaluation, we show that AdamNODEs achieve the lowest training loss and efficacy over existing neural ODEs. We also show that AdamNODEs have better training stability than classical momentum-based neural ODEs. This result sheds some light on adapting the techniques proposed in the optimization community to improving the training and inference of neural ODEs further.
- Europe > Russia (0.05)
- Asia > Russia (0.05)
- North America > United States > Oregon (0.04)
- (5 more...)
Heavy Ball Neural Ordinary Differential Equations
Xia, Hedi, Suliafu, Vai, Ji, Hangjie, Nguyen, Tan M., Bertozzi, Andrea L., Osher, Stanley J., Wang, Bao
We propose heavy ball neural ordinary differential equations (HBNODEs), leveraging the continuous limit of the classical momentum accelerated gradient descent, to improve neural ODEs (NODEs) training and inference. HBNODEs have two properties that imply practical advantages over NODEs: (i) The adjoint state of an HBNODE also satisfies an HBNODE, accelerating both forward and backward ODE solvers, thus significantly reducing the number of function evaluations (NFEs) and improving the utility of the trained models. (ii) The spectrum of HBNODEs is well structured, enabling effective learning of long-term dependencies from complex sequential data. We verify the advantages of HBNODEs over NODEs on benchmark tasks, including image classification, learning complex dynamics, and sequential modeling. Our method requires remarkably fewer forward and backward NFEs, is more accurate, and learns long-term dependencies more effectively than the other ODE-based neural network models. Code is available at \url{https://github.com/hedixia/HeavyBallNODE}.
- North America > United States > California > Los Angeles County > Los Angeles (0.14)
- North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
- Asia > Middle East > Jordan (0.04)
- (6 more...)