ARACNE: An LLM-Based Autonomous Shell Pentesting Agent

Nieponice, Tomas, Valeros, Veronica, Garcia, Sebastian

Feb-24-2025–arXiv.org Artificial Intelligence

The complete automation of cyber-attacks is an area of growing interest since the surge of Large Language Models (LLMs) in recent years. Although the application of LLM in all areas of cybersecurity has flourished, the creation of attacking LLM agents that can act independently is among the most popular options [1]. Attacking LLM agents can perform automatic security testing of applications, lowering the cost for organizations to find vulnerabilities and misconfiguration problems and identify other security issues [2]. Existing automated attacking agents, such as PenHeal [2], AutoAttacker [3], and HackSynth [4] show promising results but with clear limitations. Agents are unable to work so far without occasional mistakes and hallucinations.

agent, aracne, module, (14 more...)

arXiv.org Artificial Intelligence

Feb-24-2025

arXiv.org PDF

Add feedback

Country:
- South America > Argentina
  - Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
- North America > United States
  - Utah > Salt Lake County > Salt Lake City (0.04)
- Europe > Czechia
  - Prague (0.05)

Genre:
- Research Report (0.83)

Industry:
- Information Technology > Security & Privacy (1.00)
- Government > Military
  - Cyberwarfare (0.69)

Technology:
- Information Technology
  - Security & Privacy (1.00)
  - Artificial Intelligence
    - Natural Language > Large Language Model (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found