Robust High-Dimensional Mean Estimation With Low Data Size, an Empirical Study
Anderson, Cullen, Phillips, Jeff M.
Robust statistics aims to compute quantities to represent data where a fraction of it may be arbitrarily corrupted. The most essential statistic is the mean, and in recent years, there has been a flurry of theoretical advancement for efficiently estimating the mean in high dimensions on corrupted data. While several algorithms have been proposed that achieve near-optimal error, they all rely on large data size requirements as a function of dimension. In this paper, we perform an extensive experimentation over various mean estimation techniques where data size might not meet this requirement due to the highdimensional setting. For data with inliers generated from a Gaussian with known covariance, we find experimentally that several robust mean estimation techniques can practically improve upon the sample mean, with the quantum entropy scaling approach from Dong et.al.
Feb-16-2025
- Country:
- Asia (0.67)
- North America > United States (1.00)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Education (1.00)
- Food & Agriculture > Agriculture (1.00)
- Government (1.00)
- Health & Medicine
- Pharmaceuticals & Biotechnology (1.00)
- Therapeutic Area > Neurology (0.68)
- Information Technology > Security & Privacy (1.00)
- Law (1.00)
- Leisure & Entertainment > Sports (0.67)
- Media (0.67)
- Technology:
- Information Technology
- Artificial Intelligence
- Cognitive Science (1.00)
- Machine Learning
- Neural Networks (0.92)
- Statistical Learning (1.00)
- Natural Language (1.00)
- Representation & Reasoning (1.00)
- Communications (0.92)
- Data Science (1.00)
- Human Computer Interaction (0.67)
- Modeling & Simulation (0.92)
- Security & Privacy (1.00)
- Artificial Intelligence
- Information Technology