Learning Intrusion Prevention Policies through Optimal Stopping
–arXiv.org Artificial Intelligence
We study automated intrusion prevention using reinforcement learning. In a novel approach, we formulate the problem of intrusion prevention as an optimal stopping problem. This formulation allows us insight into the structure of the optimal policies, which turn out to be threshold based. Since the computation of the optimal defender policy using dynamic programming is not feasible for practical cases, we approximate the optimal policy through reinforcement learning in a simulation environment. To define the dynamics of the simulation, we emulate the target infrastructure and collect measurements. Our evaluations show that the learned policies are close to optimal and that they indeed can be expressed using thresholds.
arXiv.org Artificial Intelligence
Jun-14-2021
- Country:
- North America > United States
- Europe
- Sweden (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Switzerland > Zürich
- Zürich (0.04)
- Netherlands > South Holland
- Dordrecht (0.04)
- Asia > Middle East
- Jordan (0.04)
- Republic of Türkiye > İzmir Province
- İzmir (0.04)
- Genre:
- Research Report (1.00)
- Industry:
- Information Technology > Security & Privacy (1.00)