Scaling and Generalization in Neural Networks: A Case Study