Practical Variational Inference for Neural Networks
–Neural Information Processing Systems
Variational methods have been previously explored as a tractable approximation to Bayesian inference for neural networks. However the approaches proposed so far have only been applicable to a few simple network architectures. This paper introduces an easy-to-implement stochastic variational method (or equivalently, minimum description length loss function) that can be applied to most neural networks. Along the way it revisits several common regularisers from a variational perspective. It also provides a simple pruning heuristic that can both drastically reduce the number of network weights and lead to improved generalisation. Experimental results are provided for a hierarchical multidimensional recurrent neural network applied to the TIMIT speech corpus.
Neural Information Processing Systems
Mar-15-2024, 04:58:24 GMT
- Country:
- North America
- United States
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- New York > New York County
- New York City (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.05)
- California > San Mateo County
- San Mateo (0.04)
- Pennsylvania > Allegheny County
- Canada > Ontario
- Toronto (0.14)
- United States
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America