Rethinking–or Remembering–Generalization in Neural Networks