Constructing Deep Neural Networks by Bayesian Network Structure Learning
Rohekar, Raanan Y., Nisimov, Shami, Gurwicz, Yaniv, Koren, Guy, Novik, Gal
–Neural Information Processing Systems
We introduce a principled approach for unsupervised structure learning of deep neural networks. We propose a new interpretation for depth and inter-layer connectivity where conditional independencies in the input distribution are encoded hierarchically in the network structure. Thus, the depth of the network is determined inherently. The proposed method casts the problem of neural network structure learning as a problem of Bayesian network structure learning. Then, instead of directly learning the discriminative structure, it learns a generative graph, constructs its stochastic inverse, and then constructs a discriminative graph.