Neural Differential Equations for Learning to Program Neural Nets Through Continuous Learning Rules
Irie, Kazuki, Faccio, Francesco, Schmidhuber, Jürgen
–arXiv.org Artificial Intelligence
Neural ordinary differential equations (ODEs) have attracted much attention as continuous-time counterparts of deep residual neural networks (NNs), and numerous extensions for recurrent NNs have been proposed. Since the 1980s, ODEs have also been used to derive theoretical results for NN learning rules, e.g., the famous connection between Oja's rule and principal component analysis. Such rules are typically expressed as additive iterative update processes which have straightforward ODE counterparts. Here we introduce a novel combination of learning rules and Neural ODEs to build continuous-time sequence processing nets that learn to manipulate short-term memory in rapidly changing synaptic connections of other nets. This yields continuous-time counterparts of Fast Weight Programmers and linear Transformers. Our novel models outperform the best existing Neural Controlled Differential Equation based models on various time series classification tasks, while also addressing their fundamental scalability limitations.
arXiv.org Artificial Intelligence
Oct-14-2022
- Country:
- North America
- United States
- Maryland > Baltimore (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Massachusetts > Hampshire County
- Amherst (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Colorado > Denver County
- Denver (0.04)
- California > Los Angeles County
- Los Angeles (0.14)
- Long Beach (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- Switzerland (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Portugal > Porto
- Porto (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Germany > North Rhine-Westphalia
- Upper Bavaria > Munich (0.04)
- France > Hauts-de-France
- Austria > Styria
- Graz (0.04)
- Asia
- Singapore (0.04)
- Middle East > Saudi Arabia
- Mecca Province > Thuwal (0.04)
- China > Beijing
- Beijing (0.04)
- North America
- Genre:
- Research Report > Promising Solution (0.34)
- Industry:
- Health & Medicine (0.94)
- Education > Educational Setting
- Continuing Education (0.40)
- Technology: