E-Valuating Classifier Two-Sample Tests

Pandeva, Teodora, Bakker, Tim, Naesseth, Christian A., Forré, Patrick

Oct-24-2022–arXiv.org Artificial Intelligence

We propose E-C2ST, a classifier two-sample test for high-dimensional data based on E-values. Compared to $p$-values-based tests, tests with E-values have finite sample guarantees for the type I error. E-C2ST combines ideas from existing work on split likelihood ratio tests and predictive independence testing. The resulting E-values incorporate information about the alternative hypothesis. We demonstrate the utility of E-C2ST on simulated and real-life data. In all experiments, we observe that when going from small to large sample sizes, as expected, E-C2ST starts with lower power compared to other methods but eventually converges towards one. Simultaneously, E-C2ST's type I error stays substantially below the chosen significance level, which is not always the case for the baseline methods. Finally, we use an MRI dataset to demonstrate that multiplying E-values from multiple independently conducted studies leads to a combined E-value that retains the finite sample type I error guarantees while increasing the power.

artificial intelligence, experiment, machine learning, (18 more...)

arXiv.org Artificial Intelligence

Oct-24-2022

arXiv.org PDF

Add feedback

Country:
- Europe > Netherlands
  - North Holland > Amsterdam (0.04)
- North America > United States
  - New York > New York County > New York City (0.04)

Genre:
- Research Report > Experimental Study (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (0.93)
  - Representation & Reasoning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found