Structured Sparsification of Gated Recurrent Neural Networks

Lobacheva, Ekaterina, Chirkova, Nadezhda, Markovich, Alexander, Vetrov, Dmitry

Nov-13-2019–arXiv.org Machine Learning

Recently, a lot of techniques were developed to sparsify the weights of neural networks and to remove networks' structure units, e.g. neurons. We adjust the existing sparsification approaches to the gated recurrent architectures. Specifically, in addition to the sparsification of weights and neurons, we propose sparsifying the preactivations of gates. This makes some gates constant and simplifies LSTM structure. We test our approach on the text classification and language modeling tasks. We observe that the resulting structure of gate sparsity depends on the task and connect the learned structure to the specifics of the particular tasks. Our method also improves neuron-wise compression of the model in most of the tasks.

gate structure, neuron, sparsification, (15 more...)

arXiv.org Machine Learning

Nov-13-2019

arXiv.org PDF

Add feedback

Country:
- North America > Canada (0.04)
- Europe > Russia
  - Central Federal District > Moscow Oblast > Moscow (0.04)
- Asia
  - Russia (0.04)
  - Middle East > Qatar
    - Ad-Dawhah > Doha (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found