Modern Neural Networks Generalize on Small Data Sets