One Simple Trick to Fix Your Bayesian Neural Network

Tempczyk, Piotr, Smoczyński, Ksawery, Smolenski-Jensen, Philip, Cygan, Marek

Jul-26-2022–arXiv.org Artificial Intelligence

One of the most popular estimation methods in Bayesian neural networks (BNN) is mean-field variational inference (MFVI). In this work, we show that neural networks with ReLU activation function induce posteriors, that are hard to fit with MFVI. We provide a theoretical justification for this phenomenon, study it empirically, and report the results of a series of experiments to investigate the effect of activation function on the calibration of BNNs. We find that using Leaky ReLU activations leads to more Gaussian-like weight posteriors and achieves a lower expected calibration error (ECE) than its ReLU-based counterpart.

artificial intelligence, machine learning, neural network, (17 more...)

arXiv.org Artificial Intelligence

Jul-26-2022

arXiv.org PDF

Add feedback

Country:
- Europe > Poland
  - Masovia Province > Warsaw (0.05)
- Asia > Japan
  - Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.05)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found