Hacking CTFs with Plain Agents
Turtayev, Rustem, Petrov, Artem, Volkov, Dmitrii, Volk, Denis
–arXiv.org Artificial Intelligence
Cybersecurity is one of the key AI risk areas (OpenAI 2024b; The White House 2023; UK Government 2023): advanced LLMs could hack real-world systems at speeds far exceeding human capabilities (OpenAI 2024a). To quantify AI cyber capabilities, researchers use benchmarks, with InterCode-CTF (Yang, Prabhakar, Narasimhan, et al. 2023) among the most popular. InterCode-CTF adapts traditional Capture The Flag competitions to assess LLM hacking skills. Previously, Phuong et al. 2024 showed low performance on this benchmark and suggested low cyber exploitation capabilities. A recent follow-up by Abramovich et al. 2024 claimed state-ofthe-art results (72%) due to a particular novel harness design choice.
arXiv.org Artificial Intelligence
Dec-3-2024
- Country:
- Europe (0.35)
- North America > United States (0.49)
- Genre:
- Research Report (1.00)
- Industry:
- Technology: