Early Stopping Based on Repeated Significance

Bax, Eric, Sarkar, Arundhyoti, Shtoff, Alex

Aug-1-2024–arXiv.org Artificial Intelligence

For a bucket test with a single criterion for success and a fixed number of samples or testing period, requiring a $p$-value less than a specified value of $\alpha$ for the success criterion produces statistical confidence at level $1 - \alpha$. For multiple criteria, a Bonferroni correction that partitions $\alpha$ among the criteria produces statistical confidence, at the cost of requiring lower $p$-values for each criterion. The same concept can be applied to decisions about early stopping, but that can lead to strict requirements for $p$-values. We show how to address that challenge by requiring criteria to be successful at multiple decision points.

criteria, decision point, probability, (16 more...)

arXiv.org Artificial Intelligence

Aug-1-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York > New York County
    - New York City (0.04)
  - Florida > Palm Beach County
    - Boca Raton (0.04)
- Europe > United Kingdom
  - England
    - Greater London > London (0.04)
    - Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Health & Medicine (0.47)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found