Safe Testing

Grünwald, Peter, de Heide, Rianne, Koolen, Wouter

Mar-10-2023–arXiv.org Artificial Intelligence

We develop the theory of hypothesis testing based on the e-value, a notion of evidence that, unlike the p-value, allows for effortlessly combining results from several studies in the common scenario where the decision to perform a new study may depend on previous outcomes. Tests based on e-values are safe, i.e. they preserve Type-I error guarantees, under such optional continuation. We define growth-rate optimality (GRO) as an analogue of power in an optional continuation context, and we show how to construct GRO e-variables for general testing problems with composite null and alternative, emphasizing models with nuisance parameters. GRO e-values take the form of Bayes factors with special priors. We illustrate the theory using several classic examples including a one-sample safe t-test and the 2 x 2 contingency table. Sharing Fisherian, Neymanian and Jeffreys-Bayesian interpretations, e-values may provide a methodology acceptable to adherents of all three schools.

artificial intelligence, bayesian inference, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Mar-10-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - Greenland (0.04)
  - United States
    - New York (0.04)
    - Illinois (0.04)
    - Connecticut > New Haven County
      - New Haven (0.04)
- Europe
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
  - Netherlands
    - North Holland > Amsterdam (0.04)
    - South Holland
      - Leiden (0.04)
      - Delft (0.04)

Genre:
- Research Report > Experimental Study (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.67)
  - Machine Learning > Learning Graphical Models
    - Directed Networks > Bayesian Learning (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found