AITopics | geometric median

We study the robust geometric median problem in Euclidean space $\mathbb{R}^d$, with a focus on coreset construction.A coreset is a compact summary of a dataset $P$ of size $n$ that approximates the robust cost for all centers $c$ within a multiplicative error $\varepsilon$. Given an outlier count $m$, we construct a coreset of size $\tilde{O}(\varepsilon^{-2} \cdot \min\{\varepsilon^{-2}, d\})$ when $n \geq 4m$, eliminating the $O(m)$ dependency present in prior work [Huang et al., 2022 & 2023]. For the special case of $d = 1$, we achieve an optimal coreset size of $\tildeΘ(\varepsilon^{-1/2} + \frac{m}{n} \varepsilon^{-1})$, revealing a clear separation from the vanilla case studied in [Huang et al., 2023; Afshani and Chris, 2024]. Our results further extend to robust $(k,z)$-clustering in various metric spaces, eliminating the $m$-dependence under mild data assumptions. The key technical contribution is a novel non-component-wise error analysis, enabling substantial reduction of outlier influence, unlike prior methods that retain them.Empirically, our algorithms consistently outperform existing baselines in terms of size-accuracy tradeoffs and runtime, even when data assumptions are violated across a wide range of datasets.

data mining, dist, machine learning, (19 more...)

arXiv.org Machine Learning

2510.24621

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Austria > Vienna (0.14)
Asia > China > Jiangsu Province > Nanjing (0.04)
(20 more...)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

MAC Advice for Facility Location Mechanism Design

Neural Information Processing SystemsOct-10-2025, 20:14:08 GMT

We receive a prediction for each agent's location, and these predictions are crucially allowed to be only

approximation ratio, mechanism, prediction, (15 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(2 more...)

Genre: Research Report > New Finding (0.67)

Industry: Information Technology (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (0.42)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.34)

Add feedback

Private Geometric Median Mahdi Haghifam Thomas Steinke Jonathan Ullman

Neural Information Processing SystemsOct-10-2025, 02:36:23 GMT

The predominant algorithm for DP convex optimization is DP (stochastic) gradient descent, or DP-(S)GD, for short [SCS13; BST14].

algorithm, equation, step follow, (17 more...)

Neural Information Processing Systems

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.92)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.54)

Add feedback

Averaging on the Bures-Wasserstein manifold: dimension-free convergence of gradient descent

Neural Information Processing SystemsAug-17-2025, 02:22:51 GMT

We study first-order optimization algorithms for computing the barycenter of Gaussian distributions with respect to the optimal transport metric.

artificial intelligence, machine learning, optimization problem, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
(7 more...)

Industry: Government > Regional Government (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.50)

Add feedback

b9acb4ae6121c941324b2b1d3fac5c30-Paper.pdf

Neural Information Processing SystemsAug-17-2025, 02:22:47 GMT

artificial intelligence, barycenter, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts (0.14)

Industry: Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

One-shot Robust Federated Learning of Independent Component Analysis

Jin, Dian, Bing, Xin, Zhang, Yuqian

arXiv.org Machine LearningMay-28-2025

This paper investigates a general robust one-shot aggregation framework for distributed and federated Independent Component Analysis (ICA) problem. We propose a geometric median-based aggregation algorithm that leverages $k$-means clustering to resolve the permutation ambiguity in local client estimations. Our method first performs k-means to partition client-provided estimators into clusters and then aggregates estimators within each cluster using the geometric median. This approach provably remains effective even in highly heterogeneous scenarios where at most half of the clients can observe only a minimal number of samples. The key theoretical contribution lies in the combined analysis of the geometric median's error bound-aided by sample quantiles-and the maximum misclustering rates of the aforementioned solution of $k$-means. The effectiveness of the proposed approach is further supported by simulation studies conducted under various heterogeneous settings.

artificial intelligence, estimator, machine learning, (16 more...)

arXiv.org Machine Learning

2505.20532

Country: North America > Canada > Ontario > Toronto (0.14)

Genre:

Research Report (1.00)
Overview (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Filters

Collaborating Authors

geometric median

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

dimension

ea2e5f8777105309a900d30dc4898095-Paper-Conference.pdf

522ef98b1e52f5918e5abc868651175d-Paper-Conference.pdf

b9acb4ae6121c941324b2b1d3fac5c30-Supplemental.pdf

Coreset for Robust Geometric Median: Eliminating Size Dependency on Outliers

MAC Advice for Facility Location Mechanism Design

Private Geometric Median Mahdi Haghifam Thomas Steinke Jonathan Ullman

Averaging on the Bures-Wasserstein manifold: dimension-free convergence of gradient descent

b9acb4ae6121c941324b2b1d3fac5c30-Paper.pdf

One-shot Robust Federated Learning of Independent Component Analysis