Dichotomize and Generalize: PAC-Bayesian Binary Activated Deep Neural Networks
Letarte, Gaël, Germain, Pascal, Guedj, Benjamin, Laviolette, François
We present a comprehensive study of multilayer neural networks with binary activation, relying on the PAC-Bayesian theory. Our contributions are twofold: (i) we develop an end-to-end framework to train a binary activated deep neural network, overcoming the fact that binary activation function is non-differentiable; (ii) we provide nonvacuous PAC-Bayesian generalization bounds for binary activated deep neural networks. Noteworthy, our results are obtained by minimizing the expected loss of an architecture-dependent aggregation of binary activated deep neural networks. The performance of our approach is assessed on a thorough numerical experiment protocol on real-life datasets.
May-29-2019
- Country:
- North America > Canada (0.04)
- Europe
- France (0.04)
- United Kingdom (0.04)
- Genre:
- Research Report > New Finding (0.48)
- Technology: