Evaluating Intelligence via Trial and Error