Optimizing Neural Networks via Koopman Operator Theory

Oct-9-2024, 14:53:04 GMT–Neural Information Processing Systems

Koopman operator theory, a powerful framework for discovering the underlying dynamics of nonlinear dynamical systems, was recently shown to be intimately connected with neural network training. In this work, we take the first steps in making use of this connection. As Koopman operator theory is a linear theory, a successful implementation of it in evolving network weights and biases offers the promise of accelerated training, especially in the context of deep networks, where optimization is inherently a non-convex problem. We show that Koopman operator theoretic methods allow for accurate predictions of weights and biases of feedforward, fully connected deep networks over a non-trivial range of training time. During this window, we find that our approach is 10x faster than various gradient descent based methods (e.g.

deep network, koopman operator theory, optimizing neural network

Neural Information Processing Systems

Oct-9-2024, 14:53:04 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)