Hacking CTFs with Plain Agents

Turtayev, Rustem, Petrov, Artem, Volkov, Dmitrii, Volk, Denis

Dec-3-2024–arXiv.org Artificial Intelligence

Cybersecurity is one of the key AI risk areas (OpenAI 2024b; The White House 2023; UK Government 2023): advanced LLMs could hack real-world systems at speeds far exceeding human capabilities (OpenAI 2024a). To quantify AI cyber capabilities, researchers use benchmarks, with InterCode-CTF (Yang, Prabhakar, Narasimhan, et al. 2023) among the most popular. InterCode-CTF adapts traditional Capture The Flag competitions to assess LLM hacking skills. Previously, Phuong et al. 2024 showed low performance on this benchmark and suggested low cyber exploitation capabilities. A recent follow-up by Abramovich et al. 2024 claimed state-ofthe-art results (72%) due to a particular novel harness design choice.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Dec-3-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.49)
- Europe (0.35)

Genre:
- Research Report (1.00)

Industry:
- Information Technology > Security & Privacy (1.00)
- Government
  - Military > Cyberwarfare (0.36)
  - Regional Government
    - North America Government > United States Government (0.49)
    - Europe Government (0.35)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning > Generative AI (0.46)