Training neural network ensembles via trajectory sampling
Mair, Jamie F., Rose, Dominic C., Garrahan, Juan P.
–arXiv.org Artificial Intelligence
In machine learning, there is renewed interest in neural network ensembles (NNEs), whereby predictions are obtained as an aggregate from a diverse set of smaller models, rather than from a single larger model. Here, we show how to define and train a NNE using techniques from the study of rare trajectories in stochastic systems. We define an NNE in terms of the trajectory of the model parameters under a simple, and discrete in time, diffusive dynamics, and train the NNE by biasing these trajectories towards a small time-integrated loss, as controlled by appropriate counting fields which act as hyperparameters. We demonstrate the viability of this technique on a range of simple supervised learning tasks. We discuss potential advantages of our trajectory sampling approach compared with more conventional gradient based methods.
arXiv.org Artificial Intelligence
May-10-2023
- Country:
- Europe > United Kingdom
- England > Nottinghamshire > Nottingham (0.04)
- North America > United States
- Rhode Island > Providence County > Providence (0.04)
- Europe > United Kingdom
- Genre:
- Research Report (0.50)
- Technology: