Full-Capacity Unitary Recurrent Neural Networks

Wisdom, Scott, Powers, Thomas, Hershey, John, Roux, Jonathan Le, Atlas, Les

Feb-14-2020, 16:56:23 GMT–Neural Information Processing Systems

Recurrent neural networks are powerful models for processing sequential data, but they are generally plagued by vanishing and exploding gradient problems. Unitary recurrent neural networks (uRNNs), which use unitary recurrence matrices, have recently been proposed as a means to avoid these issues. However, in previous experiments, the recurrence matrices were restricted to be a product of parameterized unitary matrices, and an open question remains: when does such a parameterization fail to represent all unitary matrices, and how does this restricted representational capacity limit what can be learned? To address this question, we propose full-capacity uRNNs that optimize their recurrence matrix over all unitary matrices, leading to significantly improved performance over uRNNs that use a restricted-capacity recurrence matrix. Our contribution consists of two main components.

full-capacity unitary recurrent neural network, matrix, recurrence matrix, (6 more...)

Neural Information Processing Systems

Feb-14-2020, 16:56:23 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)