Variational Dropout Sparsifies Deep Neural Networks

Molchanov, Dmitry, Ashukha, Arsenii, Vetrov, Dmitry

Jun-13-2017–arXiv.org Machine Learning

We explore a recently proposed Variational Dropout technique that provided an elegant Bayesian interpretation to Gaussian Dropout. We extend Variational Dropout to the case when dropout rates are unbounded, propose a way to reduce the variance of the gradient estimator and report first experimental results with individual dropout rates per weight. Interestingly, it leads to extremely sparse solutions both in fully-connected and convolutional layers. This effect is similar to automatic relevance determination effect in empirical Bayes but has a number of advantages. We reduce the number of parameters up to 280 times on LeNet architectures and up to 68 times on VGG-like networks with a negligible decrease of accuracy.

artificial intelligence, dropout, machine learning, (16 more...)

arXiv.org Machine Learning

Jun-13-2017

arXiv.org PDF

Add feedback

Country:
- Europe > Russia (0.29)

Genre:
- Research Report (0.82)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks > Deep Learning (1.00)
  - Learning Graphical Models > Directed Networks
    - Bayesian Learning (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found