A Additional Background on Bayesian neural networks and variational inference Consider a training set comprising of N input-output pairs, D = { x