Recurrent Neural Network Language Models for Open Vocabulary Event-Level Cyber Anomaly Detection

Tuor, Aaron Randall (Pacific Northwest National Laboratory) | Baerwolf, Ryan (Western Washington University ) | Knowles, Nicolas (Western Washington University ) | Hutchinson, Brian (Western Washington University) | Nichols, Nicole (Pacific Northwest National Laboratory) | Jasper, Robert (Pacific Northwest National Laboratory)

Apr-6-2018–AAAI Conferences

Automated analysis methods are crucial aids for monitoring and defending a network to protect the sensitive or confidential data it hosts. This work introduces a flexible, powerful, and unsupervised approach to detecting anomalous behavior in computer and network logs; one that largely eliminates domain-dependent feature engineering employed by existing methods. By treating system logs as threads of interleaved ``sentences'' (event log lines) to train online unsupervised neural network language models, our approach provides an adaptive model of normal network behavior. We compare the effectiveness of both standard and bidirectional recurrent neural network language models at detecting malicious activity within network log data. Extending these models, we introduce a tiered recurrent architecture, which provides context by modeling sequences of users' actions over time. Compared to Isolation Forest and Principal Components Analysis, two popular anomaly detection algorithms, we observe superior performance on the Los Alamos National Laboratory Cyber Security dataset. For log-line-level red team detection, our best performing character-based model provides test set area under the receiver operator characteristic curve of 0.98, demonstrating the strong fine-grained anomaly detection performance of this approach on open vocabulary logging sources.

artificial intelligence, data mining, machine learning, (3 more...)

AAAI Conferences

Apr-6-2018

Conferences PDF

Add feedback

Country:
- North America > United States > New Mexico > Los Alamos County > Los Alamos (0.24)

Industry:
- Information Technology (0.73)
- Energy (0.53)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Anomaly Detection (1.00)
  - Artificial Intelligence > Machine Learning
    - Neural Networks > Deep Learning (0.60)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found