Why you don't overfit, and don't need Bayes if you only train for one epoch

Open in new window