Anomaly Detection in Large Scale Networks with Latent Space Models

Lee, Wesley, McCormick, Tyler H., Neil, Joshua, Sodja, Cole

Nov-13-2019–arXiv.org Machine Learning

We develop a real-time anomaly detection algorithm for directed activity on large, sparse networks. We model the propensity for future activity using a dynamic logistic model with interaction terms for sender- and receiver-specific latent factors in addition to sender- and receiver-specific popularity scores; deviations from this underlying model constitute potential anomalies. Latent nodal attributes are estimated via a variational Bayesian approach and may change over time, representing natural shifts in network activity. Estimation is augmented with a case-control approximation to take advantage of the sparsity of the network and reduces computational complexity from $O(N^2)$ to $O(E)$, where $N$ is the number of nodes and $E$ is the number of observed edges. We run our algorithm on network event records collected from an enterprise network of over 25,000 computers and are able to identify a red team attack with half the detection rate required of the model without latent interaction terms.

algorithm, bilinear mixed-effect model, probability, (16 more...)

arXiv.org Machine Learning

Nov-13-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - New Mexico > Los Alamos County
    - Los Alamos (0.04)

Genre:
- Research Report (0.64)

Industry:
- Information Technology > Security & Privacy (0.93)

Technology:
- Information Technology
  - Communications > Networks (1.00)
  - Data Science > Data Mining
    - Anomaly Detection (1.00)
  - Artificial Intelligence
    - Representation & Reasoning > Uncertainty
      - Bayesian Inference (0.66)
    - Machine Learning > Learning Graphical Models
      - Directed Networks > Bayesian Learning (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found