Review for NeurIPS paper: In search of robust measures of generalization