Bayesian Layers: A Module for Neural Network Uncertainty

Tran, Dustin, Dusenberry, Michael W., van der Wilk, Mark, Hafner, Danijar

Dec-11-2018–arXiv.org Machine Learning

We describe Bayesian Layers, a module designed for fast experimentation with neural network uncertainty. It extends neural network libraries with layers capturing uncertainty over weights (Bayesian neural nets), pre-activation units (dropout), activations ("stochastic output layers"), and the function itself (Gaussian processes). With reversible layers, one can also propagate uncertainty from input to output such as for flow-based distributions and constant-memory backpropagation. Bayesian Layers are a drop-in replacement for other layers, maintaining core features that one typically desires for experimentation. As demonstration, we fit a 10-billion parameter "Bayesian Transformer" on 512 TPUv2 cores, which replaces attention layers with their Bayesian counterpart.

artificial intelligence, deep learning, neural network, (16 more...)

arXiv.org Machine Learning

Dec-11-2018

arXiv.org PDF

Add feedback

Country:
- North America
  - Canada (0.28)
  - United States > New York (0.14)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found