Verifying Generalization in Deep Learning