On the Practical Computational Power of Finite Precision RNNs for Language Recognition

Weiss, Gail, Goldberg, Yoav, Yahav, Eran

May-13-2018–arXiv.org Machine Learning

While Recurrent Neural Networks (RNNs) are famously known to be Turing complete, this relies on infinite precision in the states and unbounded computation time. We consider the case of RNNs with finite precision whose computation time is linear in the input length. Under these limitations, we show that different RNN variants have different computational power. In particular, we show that the LSTM and the Elman-RNN with ReLU activation are strictly stronger than the RNN with a squashing activation and the GRU. This is achieved because LSTMs and ReLU-RNNs can easily implement counting behavior. We show empirically that the LSTM does indeed learn to effectively use the counting mechanism.

deep learning, dimension, neural network, (18 more...)

arXiv.org Machine Learning

May-13-2018

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East
  - Qatar (0.14)
- North America > United States
  - Texas (0.14)

Genre:
- Research Report (0.82)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found