Joint Inference for Neural Network Depth and Dropout Regularization

Jan-19-2025, 10:25:45 GMT–Neural Information Processing Systems

Dropout regularization methods prune a neural network's pre-determined backbone structure to avoid overfitting. However, a deep model still tends to be poorly calibrated with high confidence on incorrect predictions. We propose a unified Bayesian model selection method to jointly infer the most plausible network depth warranted by data, and perform dropout regularization simultaneously. In particular, to infer network depth we define a beta process over the number of hidden layers which allows it to go to infinity. Layer-wise activation probabilities induced by the beta process modulate neuron activation via binary vectors of a conjugate Bernoulli process.

joint inference, network depth and dropout regularization, neural network depth, (1 more...)

Neural Information Processing Systems

Jan-19-2025, 10:25:45 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)