Assessing Generalization of SGD via Disagreement