Entropy Estimations Using Correlated Symmetric Stable Random Projections
–Neural Information Processing Systems
Methods for efficiently estimating Shannon entropy of data streams have important applications in learning, data mining, and network anomaly detections (e.g., the DDoS attacks). For nonnegative data streams, the method of Compressed Counting (CC) [11, 13] based on maximally-skewed stable random projections can provide accurate estimates of the Shannon entropy using small storage. However, CC is no longer applicable when entries of data streams can be below zero, which is a common scenario when comparing two streams. In this paper, we propose an algorithm for entropy estimation in general data streams which allow negative entries. In our method, the Shannon entropy is approximated by the finite difference of two correlated frequency moments estimated from correlated samples of symmetric stable random variables. Interestingly, the estimator for the moment we recommend for entropy estimation barely has bounded variance itself, whereas the common geometric mean estimator (which has bounded higher-order moments) is not sufficient for entropy estimation. Our experiments confirm that this method is able to well approximate the Shannon entropy using small storage.
Neural Information Processing Systems
Mar-14-2024, 08:58:32 GMT
- Country:
- South America > Brazil (0.04)
- Europe > France (0.04)
- North America
- United States
- Wisconsin > Dane County
- Madison (0.04)
- Rhode Island > Providence County
- Providence (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.14)
- New York
- Tompkins County > Ithaca (0.04)
- New York County > New York City (0.04)
- New Jersey > Middlesex County
- New Brunswick (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Santa Clara County > Palo Alto (0.04)
- San Diego County > San Diego (0.04)
- Wisconsin > Dane County
- Canada > British Columbia
- United States
- Asia > Middle East
- Israel > Haifa District > Haifa (0.04)
- Industry:
- Information Technology > Security & Privacy (0.67)
- Technology: