Fixing Asymptotic Uncertainty of Bayesian Neural Networks with Infinite ReLU Features

Kristiadi, Agustinus, Hein, Matthias, Hennig, Philipp

Oct-6-2020–arXiv.org Machine Learning

Approximate Bayesian methods can mitigate overconfidence in ReLU networks. However, far away from the training data, even Bayesian neural networks (BNNs) can still underestimate uncertainty and thus be overconfident. We suggest to fix this by considering an infinite number of ReLU features over the input domain that are never part of the training process and thus remain at prior values. Perhaps surprisingly, we show that this model leads to a tractable Gaussian process (GP) term that can be added to a pre-trained BNN's posterior at test time with negligible cost overhead. The BNN then yields structured uncertainty in the proximity of training data, while the GP prior calibrates uncertainty far away from them. As a key contribution, we prove that the added uncertainty yields cubic predictive variance growth, and thus the ideal uniform (maximum entropy) confidence in multi-class classification far from the training data. Calibrated uncertainty is crucial for safety-critical decision making by neural networks (NNs) (Amodei et al., 2016). Standard training methods of NNs yield point estimates that, even if they are highly accurate, can still be severely overconfident (Guo et al., 2017).

bayesian inference, neural network, rgpr, (20 more...)

arXiv.org Machine Learning

Oct-6-2020

arXiv.org PDF

Add feedback

Country:
- Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (0.66)
    - Neural Networks (1.00)
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.66)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found