Safe Testing
Grünwald, Peter, de Heide, Rianne, Koolen, Wouter
–arXiv.org Artificial Intelligence
We develop the theory of hypothesis testing based on the e-value, a notion of evidence that, unlike the p-value, allows for effortlessly combining results from several studies in the common scenario where the decision to perform a new study may depend on previous outcomes. Tests based on e-values are safe, i.e. they preserve Type-I error guarantees, under such optional continuation. We define growth-rate optimality (GRO) as an analogue of power in an optional continuation context, and we show how to construct GRO e-variables for general testing problems with composite null and alternative, emphasizing models with nuisance parameters. GRO e-values take the form of Bayes factors with special priors. We illustrate the theory using several classic examples including a one-sample safe t-test and the 2 x 2 contingency table. Sharing Fisherian, Neymanian and Jeffreys-Bayesian interpretations, e-values may provide a methodology acceptable to adherents of all three schools.
arXiv.org Artificial Intelligence
Mar-10-2023
- Country:
- Europe
- Netherlands
- North Holland > Amsterdam (0.04)
- South Holland
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Netherlands
- North America
- Greenland (0.04)
- United States
- Connecticut > New Haven County
- New Haven (0.04)
- Illinois (0.04)
- New York (0.04)
- Connecticut > New Haven County
- Europe
- Genre:
- Research Report > Experimental Study (1.00)