A Statistical Perspective on Coreset Density Estimation
Turner, Paxton, Liu, Jingbo, Rigollet, Philippe
Coresets have emerged as a powerful tool to summarize data by selecting a small subset of the original observations while retaining most of its information. This approach has led to significant computational speedups but the performance of statistical procedures run on coresets is largely unexplored. In this work, we develop a statistical framework to study coresets and focus on the canonical task of nonparameteric density estimation. Our contributions are twofold. First, we establish the minimax rate of estimation achievable by coreset-based estimators. Second, we show that the practical coreset kernel density estimators are near-minimax optimal over a large class of H\"{o}lder-smooth densities.
Nov-10-2020
- Country:
- North America > United States
- New York > New York County
- New York City (0.04)
- New Jersey > Hudson County
- Hoboken (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.14)
- Illinois > Champaign County
- Champaign (0.04)
- Florida > Broward County
- Fort Lauderdale (0.04)
- California > Los Angeles County
- Los Angeles (0.14)
- New York > New York County
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Netherlands > South Holland
- Dordrecht (0.04)
- Hungary > Budapest
- Budapest (0.04)
- France > Hauts-de-France
- United Kingdom > England
- North America > United States
- Genre:
- Research Report > New Finding (0.46)
- Technology: