Kernel Robust Hypothesis Testing

Aug-5-2023–arXiv.org Artificial Intelligence

The problem of robust hypothesis testing is studied, where under the null and the alternative hypotheses, the data-generating distributions are assumed to be in some uncertainty sets, and the goal is to design a test that performs well under the worst-case distributions over the uncertainty sets. In this paper, uncertainty sets are constructed in a data-driven manner using kernel method, i.e., they are centered around empirical distributions of training samples from the null and alternative hypotheses, respectively; and are constrained via the distance between kernel mean embeddings of distributions in the reproducing kernel Hilbert space, i.e., maximum mean discrepancy (MMD). The Bayesian setting and the Neyman-Pearson setting are investigated. For the Bayesian setting where the goal is to minimize the worst-case error probability, an optimal test is firstly obtained when the alphabet is finite. When the alphabet is infinite, a tractable approximation is proposed to quantify the worst-case average error probability, and a kernel smoothing method is further applied to design test that generalizes to unseen samples. A direct robust kernel test is also proposed and proved to be exponentially consistent. For the Neyman-Pearson setting, where the goal is to minimize the worst-case probability of miss detection subject to a constraint on the worst-case probability of false alarm, an efficient robust kernel test is proposed and is shown to be asymptotically optimal. Numerical results are provided to demonstrate the performance of the proposed robust tests. Hypothesis testing is a fundamental problem in statistical inference where the goal is to distinguish among different hypotheses with a small probability of error [3]-[5]. The likelihood ratio test is known to be optimal under different settings, e.g., the Neyman-Pearson setting and the Bayesian setting [3], [5]. For example, for binary hypothesis testing, we compare the likelihood ratio between the two hypotheses with a pre-specified threshold to make the decision.

artificial intelligence, machine learning, probability, (14 more...)

arXiv.org Artificial Intelligence

Aug-5-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York
    - New York County > New York City (0.04)
    - Erie County > Buffalo (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Scientific Discovery (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found