Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures