Bayesian generative models can flag performance loss, bias, and out-of-distribution image content