Coreset for Robust Geometric Median: Eliminating Size Dependency on Outliers
Fang, Ziyi, Huang, Lingxiao, Yang, Runkai
We study the robust geometric median problem in Euclidean space $\mathbb{R}^d$, with a focus on coreset construction.A coreset is a compact summary of a dataset $P$ of size $n$ that approximates the robust cost for all centers $c$ within a multiplicative error $\varepsilon$. Given an outlier count $m$, we construct a coreset of size $\tilde{O}(\varepsilon^{-2} \cdot \min\{\varepsilon^{-2}, d\})$ when $n \geq 4m$, eliminating the $O(m)$ dependency present in prior work [Huang et al., 2022 & 2023]. For the special case of $d = 1$, we achieve an optimal coreset size of $\tildeΘ(\varepsilon^{-1/2} + \frac{m}{n} \varepsilon^{-1})$, revealing a clear separation from the vanilla case studied in [Huang et al., 2023; Afshani and Chris, 2024]. Our results further extend to robust $(k,z)$-clustering in various metric spaces, eliminating the $m$-dependence under mild data assumptions. The key technical contribution is a novel non-component-wise error analysis, enabling substantial reduction of outlier influence, unlike prior methods that retain them.Empirically, our algorithms consistently outperform existing baselines in terms of size-accuracy tradeoffs and runtime, even when data assumptions are violated across a wide range of datasets.
Oct-29-2025
- Country:
- Africa > Rwanda
- Asia
- Afghanistan > Parwan Province
- Charikar (0.04)
- China > Jiangsu Province
- Nanjing (0.04)
- Japan > Honshū
- Kansai > Kyoto Prefecture > Kyoto (0.04)
- Russia (0.04)
- Afghanistan > Parwan Province
- Europe
- North America
- Canada > British Columbia
- United States
- California
- San Francisco County > San Francisco (0.14)
- Santa Clara County > San Jose (0.04)
- District of Columbia > Washington (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Arizona > Maricopa County
- Phoenix (0.04)
- Maryland > Baltimore (0.04)
- Colorado > Denver County
- Denver (0.04)
- Texas > Travis County
- Austin (0.04)
- California
- Genre:
- Research Report > New Finding (0.66)
- Technology: