on both our theoretical contributions showing an equivalence between a notion of training speed and the Bayesian