Fixed-Point RNNs: Interpolating from Diagonal to Dense
–Neural Information Processing Systems
Linear recurrent neural networks (RNNs) and state-space models (SSMs) such as Mamba have become promising alternatives to softmax-attention as sequence mixing layers in Transformer architectures. Current models, however, do not exhibit the full state-tracking expressivity of RNNs because they rely on channel-wise (i.e.
Neural Information Processing Systems
Jun-16-2026, 18:07:09 GMT
- Country:
- North America > United States (0.28)
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Research Report
- Technology: