Bayesian Learning of Neural Network Architectures

Dikov, Georgi, van der Smagt, Patrick, Bayer, Justin

Jan-27-2019–arXiv.org Machine Learning

In this paper we propose a Bayesian method for estimating architectural parameters of neural networks, namely layer size and network depth. We do this by learning concrete distributions over these parameters. Our results show that regular networks with a learnt structure can generalise better on small datasets, while fully stochastic networks can be more robust to parameter initialisation. The proposed method relies on standard neural variational learning and, unlike randomised architecture search, does not require a retraining of the model, thus keeping the computational overhead at minimum.

architecture, layer size, neural network, (13 more...)

arXiv.org Machine Learning

Jan-27-2019

arXiv.org PDF

Add feedback

Country:
- North America > Canada
  - Ontario > Toronto (0.04)
- Europe > Germany
  - Bavaria > Upper Bavaria > Munich (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - Japan > Kyūshū & Okinawa
    - Okinawa (0.04)

Genre:
- Research Report > New Finding (0.54)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (1.00)
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found