In Search of Robust Measures of Generalization