Time-uniform confidence bands for the CDF under nonstationarity

Oct-9-2024, 22:15:20 GMT–Neural Information Processing Systems

Estimation of a complete univariate distribution from a sequence of observations is a useful primitive for both manual and automated decision making. This problem has received extensive attention in the i.i.d. We present computationally felicitous time-uniform and value-uniform bounds on the CDF of the running averaged conditional distribution of a sequence of real-valued random variables. Consistent with known impossibility results, our CDF bounds are always valid but sometimes trivial when the instance is too hard, and we give an instance-dependent convergence guarantee. The importance-weighted extension is appropriate for estimating complete counterfactual distributions of rewards given data from a randomized experiment, e.g., from an A/B test or a contextual bandit.

cdf, nonstationarity, time-uniform confidence band, (1 more...)

Neural Information Processing Systems

Oct-9-2024, 22:15:20 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence (0.88)