Towards Production-Worthy Simulation for Autonomous Cyber Operations

Tholl, Konur, Mezouar, Mariam El, Mallah, Ranwa Al

Aug-28-2025–arXiv.org Artificial Intelligence

--Simulated environments have proven invaluable in Autonomous Cyber Operations (ACO) where Reinforcement Learning (RL) agents can be trained without the computational overhead of emulation. These environments must accurately represent cybersecurity scenarios while producing the necessary signals to support RL training. In this study, we present a framework where we first extend CybORG's Cage Challenge 2 environment by implementing three new actions: Patch, Isolate, and Unisolate, to better represent the capabilities available to human operators in real-world settings. We then propose a design for agent development where we modify the reward signals and the agent's feature space to enhance training performance. T o validate these modifications, we train DQN and PPO agents in the updated environment. Our study demonstrates that CybORG can be extended with additional realistic functionality, while maintaining its ability to generate informative training signals for RL agents.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

Aug-28-2025

arXiv.org PDF

Add feedback

Country:
- Europe > Austria (0.04)
- North America > Canada
  - Ontario > Kingston (0.65)
  - Quebec > Montreal (0.04)

Genre:
- Research Report > New Finding (0.35)

Industry:
- Education > Educational Setting
  - Higher Education (0.41)
- Government
  - Military
    - Cyberwarfare (0.52)
    - Navy (0.41)
  - Regional Government > North America Government
    - Canada Government (0.51)
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning
    - Reinforcement Learning (1.00)
  - Security & Privacy (1.00)