An Investigation of how Label Smoothing Affects Generalization