AutoEval: Autonomous Evaluation of Generalist Robot Manipulation Policies in the Real World