From Two Sample Testing to Singular Gaussian Discrimination

Santoro, Leonardo V., Waghmare, Kartik G., Panaretos, Victor M.

May-8-2025–arXiv.org Machine Learning

We establish that testing for the equality of two probability measures on a general separable and compact metric space is equivalent to testing for the singularity between two corresponding Gaussian measures on a suitable Reproducing Kernel Hilbert Space. The corresponding Gaussians are defined via the notion of kernel mean and covariance embedding of a probability measure. Discerning two singular Gaussians is fundamentally simpler from an information-theoretic perspective than non-parametric two-sample testing, particularly in high-dimensional settings. Our proof leverages the Feldman-Hajek criterion for singularity/equivalence of Gaussians on Hilbert spaces, and shows that discrepancies between distributions are heavily magnified through their corresponding Gaussian embeddings: at a population level, distinct probability measures lead to essentially separated Gaussian embeddings. This appears to be a new instance of the blessing of dimensionality that can be harnessed for the design of efficient inference tools in great generality.

artificial intelligence, machine learning, operator, (18 more...)

arXiv.org Machine Learning

May-8-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
- Europe
  - Switzerland > Vaud
    - Lausanne (0.04)
  - Russia > Central Federal District
    - Moscow Oblast > Moscow (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - India (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning (0.48)
  - Supervised Learning > Representation Of Examples (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found