On the Effectiveness of Mode Exploration in Bayesian Model Averaging for Neural Networks