AutoAdvExBench: Benchmarking autonomous exploitation of adversarial example defenses

Open in new window