AITopics | urnn

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, matrix, (18 more...)

Country:

North America > United States (0.28)
Europe (0.28)

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Full-Capacity Unitary Recurrent Neural Networks

Neural Information Processing SystemsMar-17-2026, 11:33:56 GMT

Recurrent neural networks are powerful models for processing sequential data, but they are generally plagued by vanishing and exploding gradient problems. Unitary recurrent neural networks (uRNNs), which use unitary recurrence matrices, have recently been proposed as a means to avoid these issues. However, in previous experiments, the recurrence matrices were restricted to be a product of parameterized unitary matrices, and an open question remains: when does such a parameterization fail to represent all unitary matrices, and how does this restricted representational capacity limit what can be learned? To address this question, we propose full-capacity uRNNs that optimize their recurrence matrix over all unitary matrices, leading to significantly improved performance over uRNNs that use a restricted-capacity recurrence matrix. Our contribution consists of two main components. First, we provide a theoretical argument to determine if a unitary parameterization has restricted capacity. Using this argument, we show that a recently proposed unitary parameterization has restricted capacity for hidden state dimension greater than 7. Second,we show how a complete, full-capacity unitary recurrence matrix can be optimized over the differentiable manifold of unitary matrices. The resulting multiplicative gradient step is very simple and does not require gradient clipping or learning rate adaptation. We confirm the utility of our claims by empirically evaluating our new full-capacity uRNNs on both synthetic and natural data, achieving superior performance compared to both LSTMs and the original restricted-capacity uRNNs.

artificial intelligence, machine learning, matrix, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Input-Output Equivalence of Unitary and Contractive RNNs

Melikasadat Emami, Mojtaba Sahraee Ardakan, Sundeep Rangan, Alyson K. Fletcher

Neural Information Processing SystemsFeb-13-2026, 04:02:53 GMT

Neural Information Processing Systems http://nips.cc/

matrix, rnn, urnn, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Norfolk County > Wellesley (0.04)
North America > Canada (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)

Add feedback

c3581d2150ff68f3b33b22634b8adaea-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-10-2026, 04:45:20 GMT

ld tack, rnn, state correct, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.56)

Add feedback

Input-Output Equivalence of Unitary and Contractive RNNs

Neural Information Processing SystemsDec-25-2025, 19:03:45 GMT

Unitary recurrent neural networks (URNNs) have been proposed as a method to overcome the vanishing and exploding gradient problem in modeling data with long-term dependencies. A basic question is how restrictive is the unitary constraint on the possible input-output mappings of such a network? This works shows that for any contractive RNN with ReLU activations, there is a URNN with at most twice the number of hidden states and the identical input-output mapping. Hence, with ReLU activations, URNNs are as expressive as general RNNs. In contrast, for certain smooth activations, it is shown that the input-output mapping of an RNN cannot be matched with a URNN, even with an arbitrary number of states. The theoretical results are supported by experiments on modeling of slowly-varying dynamical systems.

input-output equivalence, name change, unitary and contractive rnn, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.61)

Add feedback

Full-Capacity Unitary Recurrent Neural Networks

Neural Information Processing SystemsNov-21-2025, 15:27:43 GMT

Recurrent neural networks are powerful models for processing sequential data, but they are generally plagued by vanishing and exploding gradient problems. Unitary recurrent neural networks (uRNNs), which use unitary recurrence matrices, have recently been proposed as a means to avoid these issues. However, in previous experiments, the recurrence matrices were restricted to be a product of parameterized unitary matrices, and an open question remains: when does such a parameterization fail to represent all unitary matrices, and how does this restricted representational capacity limit what can be learned? To address this question, we propose full-capacity uRNNs that optimize their recurrence matrix over all unitary matrices, leading to significantly improved performance over uRNNs that use a restricted-capacity recurrence matrix. Our contribution consists of two main components. First, we provide a theoretical argument to determine if a unitary parameterization has restricted capacity. Using this argument, we show that a recently proposed unitary parameterization has restricted capacity for hidden state dimension greater than 7. Second,we show how a complete, full-capacity unitary recurrence matrix can be optimized over the differentiable manifold of unitary matrices. The resulting multiplicative gradient step is very simple and does not require gradient clipping or learning rate adaptation. We confirm the utility of our claims by empirically evaluating our new full-capacity uRNNs on both synthetic and natural data, achieving superior performance compared to both LSTMs and the original restricted-capacity uRNNs.

full-capacity unitary recurrent neural network, matrix, recurrence matrix, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Input-Output Equivalence of Unitary and Contractive RNNs

Melikasadat Emami, Mojtaba Sahraee Ardakan, Sundeep Rangan, Alyson K. Fletcher

Neural Information Processing SystemsOct-3-2025, 07:38:17 GMT

When the transition matrix has an induced norm greater than one, the RNN may become unstable.

matrix, rnn, urnn, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Norfolk County > Wellesley (0.04)
North America > Canada (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)

Add feedback

We thank all the reviewers for their careful attention to our paper, and their helpful comments

Neural Information Processing SystemsAug-16-2025, 06:57:44 GMT

As shown in Figure 2 (top right), this solves the copying memory problem (Arjovsky et al., 2016).

careful attention, helpful comment, ld tack, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.56)

Add feedback

Reviews: Input-Output Equivalence of Unitary and Contractive RNNs

Neural Information Processing SystemsJan-26-2025, 00:47:41 GMT

UPDATE: I'm largely happy with how the authors addressed my points. I still think that the requirement for RNN to be non-expansive is quite restrictive per se, but this work may still be a good starting point for further theoretical discussion of such issues. The authors provide a straightforward proof by construction that a URNN with two times the number of hidden states as the corresponding RNN is as expressive as the RNN, i.e. can be formulated such that it produces the same outputs for the same series of inputs. While this is true for RNN with ReLU activation, the authors further prove, by linearizing around fixed points, that this is generally not true for RNN/URNN with sigmoid activation. Strengths: - Given that URNN are an important technique for modeling long-term dependencies, while avoiding some of the complexities of LSTM/GRU, rigorous theoretical results on how restrictive the unitary constraint is are timely and important.

rnn, unitary and contractive rnn, urnn, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.61)

Add feedback

Reviews: Full-Capacity Unitary Recurrent Neural Networks

Neural Information Processing SystemsJan-20-2025, 21:00:21 GMT

I think this is a strong paper in that it presents multiple theoretical and empirical contributions. The theoretical ideas and proposed optimization algorithm are in my eyes more impressive than the empirical work, which is also decent but could benefit from a more thorough analysis. In any case, I think it's a very nice continuation of the ideas presented in the original paper about uRNNs. Except for some minor issues, the paper is well written in the sense that it was easy enough for me to follow the materials about Givens operators and how they can be used to quantify the representational capacity of a unitary matrix even though this specific subject matter was rather new to me. The proofs in the supplementary material seems sound to me and are relatively simple.

full-capacity unitary recurrent neural network, review, urnn, (5 more...)

Neural Information Processing Systems

Genre: Summary/Review (0.38)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)

Add feedback

Filters

Collaborating Authors

urnn

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Full-Capacity Unitary Recurrent Neural Networks

Full-Capacity Unitary Recurrent Neural Networks

Input-Output Equivalence of Unitary and Contractive RNNs

c3581d2150ff68f3b33b22634b8adaea-AuthorFeedback.pdf

Input-Output Equivalence of Unitary and Contractive RNNs

Full-Capacity Unitary Recurrent Neural Networks

Input-Output Equivalence of Unitary and Contractive RNNs

We thank all the reviewers for their careful attention to our paper, and their helpful comments

Reviews: Input-Output Equivalence of Unitary and Contractive RNNs

Reviews: Full-Capacity Unitary Recurrent Neural Networks