Towards Scalable and Stable Parallelization of Nonlinear RNNs

May-26-2025, 15:51:40 GMT–Neural Information Processing Systems

Transformers and linear state space models can be evaluated in parallel on modern hardware, but evaluating nonlinear RNNs appears to be an inherently sequential problem. Recently, however, Lim et al. '24 developed an approach called DEER, which evaluates nonlinear RNNs in parallel by posing the states as the solution to a fixed-point problem. They derived a parallel form of Newton's method to solve the fixed-point problem and achieved significant speedups over sequential evaluation. However, the computational complexity of DEER is cubic in the state size, and the algorithm can suffer from numerical instability. We address these limitations with two novel contributions.

artificial intelligence, nonlinear rnn, scalable and stable parallelization, (4 more...)

Neural Information Processing Systems

May-26-2025, 15:51:40 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence (0.43)