Anomaly Detection for High-Dimensional Data Using Large Deviations Principle
Guggilam, Sreelekha, Chandola, Varun, Patra, Abani
Most current anomaly detection methods suffer from the curse of dimensionality when dealing with high-dimensional data. We propose an anomaly detection algorithm that can scale to high-dimensional data using concepts from the theory of large deviations. The proposed Large Deviations Anomaly Detection (LAD) algorithm is shown to outperform state of art anomaly detection methods on a variety of large and high-dimensional benchmark data sets. Exploiting the ability of the algorithm to scale to high-dimensional data, we propose an online anomaly detection method to identify anomalies in a collection of multivariate time series. We demonstrate the applicability of the online algorithm in identifying counties in the United States with anomalous trends in terms of COVID-19 related cases and deaths. Several of the identified anomalous counties correlate with counties with documented poor response to the COVID pandemic.
Sep-28-2021
- Country:
- Europe
- North America > United States
- Michigan > Wayne County
- Wayne (0.04)
- Indiana > Wayne County (0.04)
- Nebraska (0.04)
- Wyoming > Albany County
- Laramie (0.04)
- Arizona (0.04)
- North Dakota > Grand Forks County
- Grand Forks (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- Minnesota (0.04)
- Texas > Grimes County
- Anderson (0.04)
- New York > Erie County
- Buffalo (0.04)
- Michigan > Wayne County
- Genre:
- Research Report (0.40)
- Industry:
- Technology: